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DRUG DISCOVERY METHOD 
FIELD OF THE INVENTION 

The present invention relates to methods of molecule affinity determination, for 
example, for use in discovering new drugs. 



The development of a new pharmaceutical, from conception to readiness for marketing, 
typically costs hundreds of millions of dollars and takes many years. The development process 
starts with a step of matching a molecule (a potential pharmaceutical) to a target, e.g., a protein 
in a human body or in a microorganism. The matching of a molecule to a pharmaceutical is 

10 known as a drug lead, as it may lead to the development of a drug. The molecule is then 
modified to be more active, more selective and more pharmaceutically acceptable (e.g., less 
toxic and more easily administered). The failure rates at these stages are very high. 

With the development of combinatorial chemistry and automated screening techniques, 
a new method of dmg discovery has been developed. In this new method, a large library of 

15 molecules is chemically tested against a target, with the molecule having a best match being 
used as a starting point for finding a lead and/or as a lead. Some of these libraries are 
constructed empirically, for example, based on available molecules and/or molecules known to 
act as pharmaceuticals. Other libraries are constructed to have a wide a range as possible of 
different molecules. Other libraries are constructed so thdt individual molecules will have as 

20 great a chance as possible in matching a target. In general, molecules are selected to be as 
diverse as possible and to be drug like (e.g., size, chemical behavior) so that if a match is 
found it can serve as a lead. 

Some references to such libraries and/or other discovery methods include, Pickett S.D. 
at al., J. Chem. Inf Comput. Sci. 36(6), p.1214-23 (1996) and Ferguson A.M. et al., J. Biomol. 

25 Scr. 1(2), p. 65 (1996), Bunin A.B. et. al., Proc. Natl. Acad. Sci. USA 91, p. 4708-12 (1994), 
EUman J. et. al., Proc. Natl. Acad. Sci. USA 94, p. 2779-82 (1997) and Maly D.J. et. al., Proc. 
Natl. Acad. Sci. USA 97(6), p. 2419-24 (2000), the disclosures of which are incorporated 
herein by reference. 



30 model of the target is generated (e.g., x-ray crystallography, estimated tertiary layout, analogy). 
Then, the affinity of a large number of molecules is determined by calculating docking 
behavior of a model of the molecule in the model of the target. Due to the relatively primitive 



5 



BACKGROUND OF THE INVENTION 



Another, virtual, stracture based, type of screening is known, hi the virtual method, a 
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State of molecular modeling and the resulting lack of availability of models, this method is not 
currently very successful. 

Sunesis, inc., in DJ Maly et al PNAS 97(6), p 2419-24(2000), the disclosure of which 
is incorporated herein by reference, suggest using large fragments of molecules as leads and 
then linking together such matching leads that are found into larger leads that are tested again 
for matching. The fragments are provided with pre-defined linkers, for the Unking together. 

PCT appUcation PCT/US99/06734 (WO 99/49314), the disclosure of which is 
incorporated herein by reference, also describes a schome of using fragments, and then linking 
the fragments to provide leads. 

SUMMARY OF THE INVENTION 

An aspect of some embodiments of the invention relates to a target characterization 
method, in which a plurality of small, measurement molecules interact with a target and the 
target is characterized based on an analysis of the interactions of the measurement molecules 
with the target. In an exemplary embodiment of the invention, none of the measurement 
molecules is used as a lead or as a fragment of a lead, nor are the molecules selected for 
interaction based on their dmg-type diversity. Rather, the measurement molecules are selected 
based on their expected ability to measure various chemical and/or physical dimensions of the 
target. In an exemplary embodiment of the invention, while the number of measurement 
molecules is relatively small (e.g., <106), this number spans the space of characterization of 
the target molecule and can suflBce to provide a relatively complete characterization of the 
target. In other embodiments, only a partial characterization is needed and/or obtained. 
Alternatively or additionally, while the measurement molecules are selected for span reasons, 
they are also xised as leads or as fragments of a lead. 

In an exemplary embodiment of the invention, a complete process of dmg discovery 
comprises: 

(a) selecting a target; 

(b) optionally selecting a set of measurement molecules usefiil for the target, or using a 
universal librarjr, 

(c) characterizing the target using the set of measurement molecules; 

(d) reconstructing a pharmaceutical model of the target, based on the characterization; 

and 

(e) usmg the model to forward a discovery process, for example, select, reject, filter 
and/or design a dmg lead. 
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In some embodiments of the invention, a typical measurement molecule can make one 
of several measurements, and a processing method, for example clustering, is optionally used 
to ^tract the particular measurements made by the molecules. 

In an exemplary embodiment of the invention, the measurement molecules are a set of 
5 chemical gauges, of which some, typically a small number, bind to the target, typically at one 
or more active sites of the target. The binding of a gauge to the target can be determined using 
various assay methods, including substantially any of those known in the art, for example, by 
detecting a change in the chemical or biological behavior of the target or by detecting a 
reduction in the number of free gauge molecules in a sample. In a particular example, a 

10 functional assay for a protease (e.g., of an HIV protein) comprises linking a fluorescent 
molecule onto a protein (or other peptide). The protease is allowed to interact with a gauge, 
and this interaction is expected to reduce or coimteract (or enhance) its affinity for the protein^ 
which change in affinity may be determined by measuring the fluorescent properties (e.g., 
polarization) of the mixture of protein and protease. In an exemplary embodiment of the 

15 invention, each gauge is selected to have an affinity to one or more particular geometric 
layouts. In an exemplary embodiment of the invention, the total geometry of a target area is 
reconstructed from the determination of affinity (and/or lack of affinity) of a plurality of 
gauges. 

In an exemplary embodiment of the invention, each of the gauges is constructed from a 
20 scaffold to which a plurality of particular chemical moieties are attached. Three such moieties 
define a triangle of moieties which includes both a definition of tiie moieties at the vertexes 
and the distance between the vertexes. In an exemplary embodiment of the invention, the 
scaffolds and moieties are selected so that the triangles are relatively rigid, however, some 
degree of play in the length of the triangle sides (inter-moiety distances) may be desirable. 
25 Each such moiety triangle matches a particular spatial layout of three binding sites that 

match the moieties. Optionally, the distance between the moieties is varied for different 
gauges, so that a range of triangles with various desired combinations of moieties and 
distances between the moieties is provided. As will be shown below, a gauge library that 
includes a spanning set of such triangles, both with regards to distance and with regards to 
30 moiety is not prohibitively large. 

In an exemplary embodiment of the invention, the scaffold and/or the moieties are 
selected to have a minimum flexibility, so that they more specifically define the geometric 
features that they match. 
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Optionally, the scaffolds and/or the moieties are selected to have a low molecular 
weight, so as to improve linking of low affinity gauges and/or targets and possibly provide 
information for such cases. 

In an exemplary embodiment of the invention, when selecting gauges for a 
5 measurement library, some degree of overlap of moiety triangle is provided. For example, an 
repetition overlap factor of 2 or 3 maybe provided (e.g., each triangle appears in at least 2 or 3 
gauges). This is expected to increase the probability of finding a triangle that binds, especially 
in view of problems which may occur such as steric clashes, chemical mismatch and/or 
solubility. Typically, an exact repetition of the moiety triangle is not available, so a nearly 

10 similar triangle is used for providing the overlap. In some cases, the triangles are selected so 
that for at least some pairs of moieties on the target, a triangle with a smaller distance between 
the same moieties and a triangle with a larger distance between the same moieties are both 
available for bmding. This provides a non-repetition overlap factor. Alternatively to 2 or 3, a 
lower or higher overlap factor, for example 4 or 6, and/or possibly a firactional factor (e.g., an 

15 average overlap), may be used. The overiap may be uniform on the library, or a greater overlap 
may be provided for some triangles and/or molecules, for example for molecules where there 
is a greater probability of steric clashing due to the scaffold and/or other moieties, or based on 
experimental results which indicate that certain gauges and/or triangles are difficult to bind. 

It should be noted that if a molecule is required to distort in order to bind, its likeUhood 

20 of binding is typically lower. Thus, the actual overlap between two dissimilar triangles of two 
gauges may be non-uniform and dependent on the total binding probability. In general, if a 
probability of discovery of biding in an assay is negligible, it is assumed that the gauge does 
not bind. This helps define the range of distortion that can be used to define coverage and 
overlap. In some embodiments of the invention, the molecules are substantially rigid, so the 

25 cut-off of degree of distortion is more clearly defmed and limited. 

A particular exemplary drug discovery process in accordance with an exemplary 
embodiment of the invention, is as follows: 

(a) Synthesize a library of small molecules designed to span all possible 3-point 
pharmacophores (all combinations of 3 elementary chemical moieties and distances between 

30 them). This is a finite library which may include, for example -100,000 compounds. This is 
termed a USL (Universal Screening Library), due to its generalized nature of ability (e.g., in 
some embodiments of the invention) to be used for mapping a wide range of targets for which 
small molecule dmgs are designed. 
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(b) For any target, screen the USL against that target, looking for weakly active 
compounds (affinity of -100 microM). Theoretical considerations and experimental data 
indicate that 100-1000 hits should be expected for any target. 

(c) Computationally analyze the active molecules, seeking: 

5 1 . 3-Point-Pharmacophores (3PP 's) involved in binding of the hits. 

2. Reconstruction of the binding-site topography in terms of chemical 
moieties involved in binding. Generate the complete pharmacophore (--10-20 points) of the 
binding-site, 

(d) Computationally identify molecules that may compliment a large enough (e.g., 
10 6-8 points for nanoMolar binding) subset of the full pharmacophore. Optionally, by knowing 

which parts of these molecules are not directly involved in binding, design them to meet 
predefined dmg-like qualities (e.g. using lipinski's rules of 5). 

(e) Using well known chemical knowledge, chose those molecules most amenable 
to synthesis and other considerations (e.g., toxicity) and synthesize those as possible drug 

15 candidates. 

(f) Testing and iterations. 

An aspect of some embodiments of the invention relates to estimating a spatial layout 
of binding locations in a target molecule. In an exemplary embodiment of the invention, the 
binding of a plurality of small molecules to the target is determined, for example using assay 

20 methods. In an exemplary embodiment of the invention, the small molecules are selected to 
have or are each modeled as a set of geometrical sub-structures which may, on its own, bind to 
the target. In one example, the geometrical sub-structure may be three moieties arranged in a 
triangle. In an exemplary embodiment of the invention, the assay results are analyzed to 
determine which of the many geometrical sub-structures in the small molecules, actually bind 

25 to the target molecule. In an exemplary embodiment of the invention, a clustering method is 
used to determine which geometrical sub-stmctures bind, by clustering together molecules that 
bind and that have similar geometrical sub-stractures. The output of the clustering method may 
be a hst of all the probably binding sub-stmctures. Optionally, the sub-structures used for 
analysis and for design of the gauges is triangular. 

30 In an exemplary embodiment of the invention, a score based method is used to convert 

a Ust of geometric sub-stmctures (e.g., triangles) into a complete geometric structure, by: 

(a) generating possible structures fi-om the list of sub-structures; 

(b) associating a "correctness" score with each stmcture; and 
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(c) selecting between structures based on their score. 

In an exemplary embodiment of the invention, the score represents the probability of 
two sub-structures sharing a portion in the structure and, optionally, a higher score is provided 
for a structure in which a portion is shared, as that represents a more cohesive structure. 
5 Alternatively or additionally, the score represents the probability of two different moieties 
binding to a same binding location, and, optionally, a higher score provided if more moieties 
share a same binding site, as this represents a minimization of pharmacophore points to the 
minimum required. Other heuristic rales may be used as well. 

hi an exemplary embodiment of the invention, the set of all potential models is not 
10 actually built. Instead a search is made of the space of models and the models are built (and/or 
rejected) ad-hoc based on the detemiined sub-structures. 

In an alternative embodiment of the invention, a clustering method is used, comprising 
for example: 

(a) generating (all) possible structures from the foimd triangles, optionally using 
1 5 particular construction mles ; 

(b) finding the most common large sub-structures that are shared by multiple 
structures; and 

(c) selecting a particular common sub-structure, optionally using a scoring method, 
such as cluster size, edge size and thresholding of cluster size, possibly selecting a most 

20 common substracture from all those that pass a certain threshold. In some cases, more than one 
final resulting sub-structure will be provided. 

It should be noted that an actual pharmacophore may not be a limited size and strictly 
defined entity, for example, a point that is technically outside the active area, can act as a 
pharmacophore if a small molecule drug binding to that point includes a tail that blocks the 

25 active area fi-om interacting with the substrate. Often however, the "relevance" of a binding 
area will decrease as the area is further away from an active area, a control area and/or a 
confomiance changing area. In addition, the binding affinity of a protein is often significantly 
smaller away from such areas. 

Li an exemplary embodiment of the invention, the stmctures for clustering are 
30 generated in the following manner: 

(a) a triangle is selected as a base sub-structure; 

(b) a point is added to the base sub-structure, if there are two triangles that, together 
with a triangle on the sub-stracture, define a tetrahedral; and 
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(c) (b) is Treated until there are no unused triangles left to add. 
An aspect of some embodiments of the invention relates to finding one or more 
molecules (e.g., a drug lead) that is expected to match a target, from a plurality of geometric 
and/or chemical measurements of the target area. The measxirements are optionally used to 
5 generate a reconstruction model of the target, against which model various processing methods 
may be applied, for example using suitable computer hardware or software. In an exemplary 
embodiment of the invention, the measurements are provided by interacting the target with a 
plurahty of gauge molecules and determining the degree of binding of the gauge molecules to 
the target. For example, a set of triangular geometries is determined by gauge matches and is 
1 0 correlated to recreate a three-dimensional model of the target area. 

Optionally, the target area is compared to structures of known pharmaceuticals or 
pharmaceutical like materials, for example a dmg lead library. Alternatively or additionally, 
the target area geometry is used to select a most likely candidate from a relatively small 
pl\irality of materials. Alternatively or additionally, the matching is used during the process of 
1 5 drug development, to select or reject modifications of drug leads, which do or do not match the 
target area geometry. 

In a particular example, if one wants to satisfy Lipinski's rules by adding or subtracting 
H-bond donors/acceptors, knowing which ones are important for binding would indicate which 
ones not to remove, and knowing which parts of the molecule are not important would indicate 

20 where additions can be made without hurting the binding. 

An aspect of some embodiments of the invention relates to a library of gauges for 
measuring a biochemical target. In an exemplary embodiment of the invention, the library 
comprises a large number of molecules constructed by attaching moieties on a relatively small 
number of scaffolds. In an exemplary embodiment of the invention, the moieties are selected 

25 to have as low a molecular weight as possible. Altematively or additionally, the Ubrary is 
designed to cover, in a desired manner, a set of parametrically defined geometric sub- 
structures. Possibly, the geometric sub-structures are triangles, with different moieties at their 
vertexes. Li one example, the range of different triangle dimensions is evenly covered. 

In an exemplary embodiment of the invention, the Ubrary is selected to provide same 

30 (overlapping) geometric sub-structures based on several scaffolds and/or in several molecules, 
for example, each sub-structure being provided twice or thrice. Optionally, the overlapping is 
designed to take mto account steric clashes and/or different chemistries of different scaffolds 
and/or gauges. 
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Jn an exemplary embodiment of the invention, die scaffolds used include at least two, 
at least five, at least seven, at least 10 or any greater or intermediate number, such as at least all 
of the following scaffolds: mono-caibone; pyrrole; quinoline pyrazinoquinazoUne; 
isoindoloindole; isoindoloindole with an oxygen moiety attached; indolo[2,3-b]quinoline; 
pyrrolizine; 2,2'-bipyrrolone; indolizine; Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; 
Pyrimidine; Pyrazine; 6H-Thieno[2,3-b]pyrrole; l,6-Dihydro-pyirolo[2,3-b]pyrrole; IH- 
Ihdole; Thieno[2,3-d]pyrimidine; 6,7-Dihydro-pyrazolo[l,5-a]pyrimidine; Quinoline; 
Isoquinoline; Quinoxaline; 3,4-Dihydro-benzo[e][l,4]diazepin-5-one; 3,8-Dihydro-4H- 
pyiiolo[2,3-e][l,4]diazepin-5-one; 3,4-Dihydro-thieno[2,3-e][l,4]diazepin-5-one; 3,6- 
Dihydro-4H-pyrrolo[3,2-e][l,4]diazepin-5-one; 5H,1 lH-Dibenzo[b,f|[l,5]diazocine-6,12- 
dione; l,4-Dihydro-10H-l,4,10-l,4,10-triaza-benzo[a]cyclopenta[e]cyclooctene-5,ll-dione; 
4H,1 OH-1 -Thia-4,1 0-diaza-benzo[a]cyclopenta[e]cyclooctene-S, 1 1 -dione; Dipyrrolo[ 1 ,2- 
c;2',l'-e]imidazol-5-one; l,4,7,9-Tetrahydro-l,4,6,9-tetraaza-dicyclopenta[a,e]cyclooctene- 
5,1 0-dione; 4,7,9-Trihydro-l -thia-4,6,9-triaza-dicyclopenta[a,e]cyclooctene-5, 1 0-dione; 

2,4,9,Trihydro-llambda*4*,6-dithia-4,9-diaza-dicyclopenta[a,e]cyclooctene-5 , 1 0-dione; 6,9- 
Dihydro-5H-l-thia-5,8,9,triaza-cyclopenta[a]azulen-4-one; 3,10,Dihydro-4H- 
[l,4]diazepino[5,6-b]indol-5-one; 3,6-Dihydro-4H-[l,4]diazepino[6,5-b]indol-5-one; 7,8- 
Dihydro- IH- 1 ,7, 1 0-triaza-cyclohepta[e]inden-6-one; 8,9-Dihydro-3H-3,6,9-triaza- 
cyclohepta[e]mden-10-one; 7,8-Dihydro-lH-l,5,8-triaza-cyclohepta[f|inden-9-one; 8,9- 
Dihydro-5,6,9,ll-tetraaza-cyclohept[b]n^hthalcne-10-one; 3,4-Dihydro-[l,4]diazepino[5,6- 
b]quinolin-5-one; 8,9-Dihydro-4,8,ll-triaza-cyclohq)ta[a]nq)hthalene-7-one; 1 lH-10,1 1- 
Diaza-benzo[b]fluorine; a-hydroxyacids; a-aminoacids; cohels; Bicyclo[2.2.2]octane; 2- 
Mediylene-2,3-dihydrobenzo[ 1 ,4]dioxine; 6,7-Dihydro-2H-pyrazino[ 1 ,2-a]pyramidine; 9H- 
Fluorene; l,4-Diaza-bictclo[2.2.2]octane; l-Aza-bicyclo[2.2.2]octane; Pyrido[2,3- 
d]pyrimidine; 5-Methylene- 1 ,5-dihydro-pyrrol-2-one; Bezno[4,5]imidazo[ 1 ,2-a]pyriimdine; 
1 ,4-Dihydro-benzo[4,5]imidazo[ 1 ,2-a]pyrimidine; 4, 1 0-Dihydro- 1 ,4a, 1 0-triaza-phenanthren-9- 
one; l,5-Dihydro-imidazo[l,2-a]pyrimidin-2-one; l,2,3,5-Tetrahydro-imidazo[l,2- 
a]pyriinidine; Thiazolo[3,2-a]thieno[2,3-d]pyrimidin-5-one; l,9-Dithia-4a,10-diaza- 
cyclopenta[b]fluoren-4-one; 5,6-Dihydro-l-thia-5,7,8,9a-tetraaza-cyclopenta[e]azulen-4-one; 
6,10-Dihydro-5H-l-thia-5,7,10a-triaza-benzo[e]azulen-4-one; 4,5-Dihydro-3-thia-4,5a,10- 
triaza-cyclopenta[a]fluorine; 8H-l-Thia-cyclopenta[a]indene; 3-Thia-4,5a,10-triaza- 
cyclopenta[a]fluorine; 6,7,9,ll-Tetrahydro-10-thia-6,9-diaza-indeno[l,2-a]azulene-5,8-dione; 
2,3,6,7, 12a-Hexahydropyrazino[ 1' ,2' : 1 ,6]pyrido[3,4-b]indole-l ,4-dione; 5, 1 0-Dihydio-4H- 

8 
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2,3a,10-tria2a-cyclopeiita[a]fluorme; 5H.Pyrido[4,3-b]indole; llH-Indolizino[l,2-b]quinolin- 
9-one; l,2-Dihydro-2,4a,9rtriaza-anthracene-3,10-dione; 6H-Isomdolo[2,l-a]mdole; 1,5- 
Dihydro-benzo[b][l,4]dia2epin-2-one; 5,10-Dihydro-dibenzo[b,e][l,4]diazepin-ll-one; 5,11- 
Dihydro-beii2o[e]pyrido[3,2-b][l,4]dia2epin-6-one; 4,9-Dihydro-3-thia-4,9-diaza- 
5 benzo[flazulen-10-one; Benzo[g]quinoxaline; Pyrazino[2,3-b]quinoxaline; Pyrido[2,l- 
blquinazolin- 1 1-one; 1 -Thia-4a,9-diaza-cyclopenta[b]naphthalene-4-one; 2-Methylene-4H- 
benzo[ 1 ,4]thia2in-3-one. 

In general, the greater the number of scaffolds, the easier it is to find right sizes of 
gauges and also deal with a wider range of steric clash conditions and/or different chemistries. 
10 On the other hand, smaller niunber of scaffolds, promotes uniformity of chemical behavior and 
synthesis methods. 

In an exemplary embodiment of the invention, the moieties used include, at least 2, at 
least 4, at least 6, or any greater number, such as all of the following moieties: Me, Et, Pr, Ph, 
CO2H, OH, NH2> ketone, halides, such as 01 or Br, other acids such as SO3H, PO3H2, and 
1 5 NH-C=NH(-NH2) (Guanidine). 

In general, using more moieties may provide greater accuracy in characterizing binding, 
at a possible expense of library size. Using fewer moieties may also simplify synthesis 
methods. 

An aspect of some embodiments of the invention relates to selecting a gauge Ubrary for 
20 use in characterizing a target. In an exemplary embodiment of the invention, a range of 
dimensions of target geometries is estimated, as well as bond types of binding locations. A set 
of molecules that spans the range of possible sizes and bond types is selected firom a larger 
available set of molecules. The selection may be, for example, electronic with selected 
molecules being synthesized in response to selection or the selection is physical, with the 
25 gauge molecules already available. Optionally, the estimation uses various information known 
about the target. Alternatively or additionally, the estimation is made using a first screening 
library, that is, for example, more flexible in the afiBnity of its bond types and/or uses 
molecules that are more flexible. 

Optionally, the gauges are selected so that the library will have considerable repetition, 
30 for example to overcome steric clashes and/or oflier properties of the molecules, that might 
prevent binding. Optionally, the library includes at least one, or possibly more than one multi- 
point binding geometries, for at least some of the physical geometries, for example, triangles 
and pentagons. 
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la accordance with exemplary embodiments of the invention, such a library can be used 
on its own or as part of a different library for various uses. In an exemplary embodiment of the 
invention, such a spanning library is used to increase the probability of binding of any of the 
gauges in the library to the target, desirably, a considerable number of gauges. It is noted that a 
5 standard lead library often provides no bindings at all. Optionally, the bindings results are used 
to gather information about the target, especially statistical information. Optionally, the 
statistical information is used to provide structural information about the target. Optionally, the 
structural information comprises a chemical and/or geometrical structure of a significant part 
of the target, for example, an active area thereof It should be noted that in an exemplary 
10 embodiment of the invention, once even a single binding is found, useful information about 
the target is available and any library that assists in guaranteeing this binding has a use. 

An aspect of some embodiments of the invention relates to designing and/or creating a 
gauge library for use in characterizing target molecules by geometrical and/or chemical 
measurements* 

15 In an exemplary embodiment of the invention, library construction comprises: 

(a) identifying molecules that may be suitable as gauges; 

(b) determining if the identified molecules provide required gauges; and 

(c) verifying that the molecules are realistic, for example being readily synthesizable 
and/or having desirable chemical behavior. It should be noted that this order is flexible, for 

20 example as shown below. 

In one example, this method is used when basing at least part of a gauge library on 
existing libraries. In some libraries, (c) is ateady performed when the library is originally 
composed. Further, in some case, rather than select molecules, known existing binding results 
of certain molecules are used as input, instead of selecting a gauge and physically testing the 

25 binding affinity. 

Alternatively, candidate gauges may be provided as a group, for example, when a new 
scaffold is added to a Ubrary. A large number of candidates then arise, as attachments of 
different moieties to the scaffold. Jn this case, however, an opposite step may be taken - a 
scaffold may be rejected because it does not add any (or enough) gauges that do not overlap 

30 with existing gauges. For some parts of the spanned space, scaffolds that generate few gauges 
maybe suitable. 
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Id an alternative method, chemical design methodology is £q)plied to design gauges 
and/or scaffolds that have desired properties and/or geometries, for example, to fill in missing 
parts of a measurement space. 

Li an exemplary embodiment of the invention, one or more of the following are 
5 considered to be desirable properties of gauges, however, a gauge need not have all or even 
any of the following properties, in order to be useful for some embodiments of the invention: 

(a) High rigidity. This may allow measurements to be more exact, however, a small 
degree of flexibility may be desirable, to allow complete coverage of all the space. Rigid 
means that the length and/or relative angles of the bonds do not change a significant amount. 
10 (b) Low mass. This may increase the chance of bonding even if affinity is low and only 

three points on the gauge bind. 

(c) Small size. This may allow targets to be more easily measured and steric clashes 
more easily avoided. 

(d) Non-toxic. This may allow the use of the gauge in living cells. However, due to the 
1 5 differing sensitivity of different cells, this often cannot be ensiired. 

(e) Good chemical behavior. This means that the gauge is soluble and binds under 
conditions that do not distort the gauge, or distort it by a known amount. 

(f) Strong binding. This means in one embodiment of the invention, for example, 1-100 
micromolar, which is usefiil for example if solubility is low or toxicity is high. 

20 hi an exemplary embodiment of the invention, one or more of the following are 

considered to be good properties of scaffolds, however, a scaffold need not have all or even 
any of the following properties, in order to be usefiil for some raibodiments of the invention: 

(a) Easy to attach moieties (e.g., synthesize gauges) and obtain pure solutions of 
particular gauges. 

25 (b) Provide a wide range of sizes. 

(c) Have many (e.g., >3, better >4, >5) attachment points. While every hydrogen atom 
in a molecule is potentially an attachment point, in an exemplary embodiment of the invention, 
a usefiil attachment point is accessible for chemical manipulation. 

(d) What (relatively rare in other gauges) chemistries possibilities and/or gauge sizes 
30 are added to the library, by inclusion of the scaffold. 

(e) Allow attachment of various combinations of moieties, as not all combinations will 
work with all scaffolds. 

11 
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In an exemplary embodiment of the invention, one or more of the following are 
considered to be desirable properties of a gauge library: 

(a) Spanning of a range of distances between bonds. 

(b) Chemical spanning. At points on opposite ends of bonds, a wide range of moieties 
5 are provided. 

(c) Sub-structure spanning. For the sub-structiure selected, e.g., a triangle, all possible 
triangle conjBgurations in a target can bind to at least one gauge in the library. 

(d) Small. The smaller the library the better. For practical reasons, the Ubrary cannot be 
too small, however, very large libraries are generally not necessary. 

10 (e) Variations of gauge properties within library to match the density of gauge 

coverage, for example, less rigid bond lengths to cover missing or spaced apart bonds. 

(f) Uniform coverage. Various types of uniformity may be provided, for example, 
uniformity in absolute sizes or imiformity corrected for chemical dependencies. For example, 
the density of distances for short bond lengths will be higher than for long bond lengths, to 

15 provide a same normalized density for different lengths. 

(g) Degree and type of overlap. While more overlap is generally better for 
reconstruction and chemical generalization, it often comes at a cost of library size and cost. An 
overlap of three (e.g., each triangle is provided in three gauges) is an exemplary compromise. 

In general, however, the desirable properties may depend on the target, enviroimaent 
20 and/or type of discovery method being applied. In particular, it is noted that in some cases, the 
generated library is only partial, for example spaiming only a part of the space, being suitable 
for only part of a target, being in a lower resolution, having less (or no) overlap and/or being 
prone to fail for some types of targets. 

A broad aspect of some embodiments of the invention relates to molecules, such as 
25 gauges and scaffolds and methods of synthesis thereof, which may find use for libraries in 
accordance with exemplary embodiments of the invention. 

There is thus provided in accordance with an exemplary embodiment of the invention, 
a method of obtaining information about a chemically active area of a target molecule, 
comprising: 

30 providing a set of substantially rigid chemical gauges; 

reacting said target with a plurality of gauges of said set of gauges; 

assaying a binding of said gauges with said target to obtain a plurality of assay results; 

and 
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analyzing said assay results to obtain infomiation about said chemically active area. 
Optionally, said gauges allow rotation of moieties of said gauges. Alternatively or additionally, 
said gauges are constructed using a rigid sca£fold. 

In an exemplary embodiment of the invention, constituent atoms of said gauges do not 
move more than 1 A unless at least 20Kcal/Mol are applied to the gauge. 

In an exemplary embodiment of the invention, analyzing comprises identifying a 
plurality of spatial and chemically specific bindings configurations in said target active area. 
Optionally, said configurations comprise triangular configurations. Alternatively or 
additionally, identifying comprises identifying a configuration that matches a configuration of 
a bound gauge. Alternatively or additionally, identifying comprises identifying a configuration 
that does not match a configuration of a boimd gauge. Optionally, identifying comprises 
identifying by statistical analysis of said assay resxilts. Optionally, identifying comprises 
identifying by clustering. 

In an exemplary embodiment of the invention, identifying comprises assimiing each 
gauge indicates a single configuration. Alternatively or additionally, identifying comprises 
assuming at least some of the gauges indicate a plurality of configurations. Altematively or 
additionally, identifying comprises classifying gauges by chemical moieties at vertexes of said 
configurations. 

In an exemplary embodiment of the invention, the method comprises reconstructing a 
spatial map of at least part of said chemically active area, from at least two of said assay 
results, said part including at least four chemical binding areas. Optionally, said part includes 
at least six chemical binding areas. 

In an exemplary embodiment of the invention, the method comprises reconstructing a 
spatial map of at least part of said chemically active area, fi-om at least two of configurations, 
said part including at least four chemical binding points. Optionally, said part includes at least 
six chemical binding areas. 

In an exemplary embodiment of the invention, reconstructing comprises: 

test-reconstructing a pluraUty of spatial maps from said configurations; 

scoring said maps; and 

selected a spatial map based on its score. Altematively or additionally, reconstracting 
comprises: 

test-reconstructing a plurality of spatial maps from said configurations; 
clustering said maps according to common substructures; and 

13 
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selected a spatial map based on a relative property of a cluster it belongs to. Optionally, 
said relative property comprises size. 

In an exemplary embodiment of the invention, said spatial map includes enough 
binding points to ensure binding of a small molecule drug having a chemical profile matching 
the binding points. Optionally, said spatial map includes at least 6 binding points. Optionally, 
said spatial map includes at least 8 binding points. 

In an exemplary embodiment of the invention, said set of gauges comprises a set of 
gauges with at least 10,000 gauges. Optionally, said set of gauges comprises a set of gauges 
with at least 50,000 gauges. 

In an exemplary embodiment of the invention, said gauges comprise moieties arranged 
in spatial configurations and wherein said gauges are selected to span a virtual space of spatial 
chiemical configurations. 

In an exemplary embodiment of the invention, substantially each point of virtual space 
that is panned by said gauges is covered by at least two gauges. Optionally, substantially each 
point of virtual space that is spanned by said gauges is covered by at least three gauges. 

In an exemplary embodiment of the invention, at least 0.5% of said gauges bind with 
said target. Optionally, at least 1% of said gauges bind with said target. Optionally, at least 3% 
of said gauges bind with said target. 

In an exemplary embodiment of the invention, at least 50% of said gauges are defined 
by adding moieties to a set of fewer than 100 scaffolds. Optionally, at least 50% of said gauges 
are defined by adding moieties to a set of fewer than 50 scaffolds. 

In an exemplary embodiment of the invention, at least said set of gauges uses fewer 
than 15 different chemical moieties to define the chemical behavior of said gauges. 

In an exemplary embodiment of the invention, , at least said set of gauges uses fewer 
than 10 different chemical moieties to define the chemical behavior of said gauges. 

In an exemplary embodiment of the invention, said assay is a fimctional assay. 
Altematively or additionally, said assay is a binding assay. Alternatively or additionally, said 
assay is a cellular assay. Altematively or additionally, said assay is a flow-through assay. 

In an exemplary embodiment of the invention, said fimctional assay is performed in the 
presence of a natural substrate of said target.. 

In an exemplary embodiment of the invention, said target comprises a protein including 
a biochemically active area adapted to engage a substrate. Optionally, said chemically active 
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area comprises an area including said biochemically active area. Alternatively or additionally, 
said chemically active area comprises a control area of said protein. 

Li an exemplary embodiment of the invention, analyzing comprises analyzing 
successfixl binding of at least 60 gauges. Alternatively or additionally, analyzing comprises 
analyzing successful binding of at least 10 gauges. Alternatively or additionally, analyzing 
comprises analyzing successful binding of at least 100 gauges. 

In an exemplaiy embodiment of the invention, identifying comprises identifying at 
least 40 different configurations. Alternatively or additionally, identifying comprises 
identifying at least 10 different configurations. Altematively or additionally, identifying 
comprises identifying at least 100 different configurations. 

In an exemplary embodiment of the invention, the method comprises: 

comparing said map to a lead data base; and 

selecting a lead fix>m said data base for further use responsive to a semblance or lack of 
semblance between said lead and said map. 

Altematively or additionally, the method comprises: 
comparing said map to a lead data base; and 

rejecting a lead fi-om said data base for further use responsive to a semblance between 
said lead and said map. 

Altematively or additionally, the method comprises: 

constmcting a lead to have a semblance to said map. Optionally, constructing 
comprises constructing using said gauges or scaffolds used to define said gauges. 
In an exemplary embodiment of the invention, the method comprises: 
comparing said configurations to a lead data base; and 

selecting a lead firom said data base for further use responsive to a matching of said 
configurations to said lead. 

In an exemplary embodiment of the invention, the method comprises constructing a 
lead based on said configurations. 

In an exemplary embodiment of the invention, the method comprises selecting at least 
one of said gauges as a lead for drug discovery. 

In an exemplary embodiment of the invention, the method comprises comparing the 
binding of gauges with similar binding geometries to obtain steric clashing data; and 

analyzing said steric clashing data to provide geometrical inforaiation about said target. 
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There is also provided in accordance with an exemplary embodiment of the invention, 
a method of identifying the existence of a plurality of chemical-spatial configurations in a 
target, comprising: 

assaying the target with a pluraUty of gauges having know chemical-spatial 
configurations at vertexes thereof, to provide a plurality of assay results; 

defining an array of spaces, one q)ace for each set of chemical behaviors of the 
vertexes of each configuration; 

indicating said results according to said spaces, to generate clusters; and 

identifying the existaice of a configuration in said target &om said clusters. Optionally, 
mdicating comprises spreading an indication responsive to a spreading fimction. Optionally, 
said spreading fimction is dependent on an estimated energy of binding of a gauge to said 
target. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a method of reconstructing a spatial shape of a chemical binding configuration of a target from 
a set of sub-shapes, each of which indicates a part of said binding configuration, comprising: 

selecting a base from said sub-shapes; 

selecting at least two sub-shapes having the property that they match each other at least 
along one side thereof and match said base along another side thereof 
accumulating said sub-sh^es to said base; and 

repeating said selecting and said accumulating until all of said sub-shapes are used or 
cannot be used, thereby providing a shape of a binding configuration of said target. Optionally, 
the method comprises variationally repeating said selecting, accumulating and repeating using 
a different order of selection of sub-shapes. Optionally, the method comprises repeating said 
selecting a base and said variationally repeating for a plurality of different base selections. 
Optionally, the method comprises clustering a plurality of such shapes according to shared 
sub-component shapes. Optionally, the method comprises selecting a sub-component sh^e as 
a resulting shape based on said clustering. 

In an exemplary embodiment of the invention, said sub-shapes comprise triangles. 
Altematively or additionally, said sub-shapes define chemical behavior at their vertexes and 
wherein two sides are said to match if the chemical behavior at their vertexes match. 

In an racemplary embodiment of the invention, two sides are said to match if flieir 
length is similar. 
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There is also provided in accordance with an exemplary embodiment of the invention, 
a method of selecting a scaffold for use in generating a part of a screening library, comprising: 

providing a potential scaffold molecule including a plurality of possible attachment 
points for moieties; 

determining a rigidity of the molecule; and 

rejecting said potential scaffold molecule responsive to a lack of rigidity of said 
scaffold. Optionally, said lack of rigidity is absolute. Altematively, said lack of rigidity is 
relative to other potential scaffolds. 

In an exemplary embodiment of the invention, the method comprises selecting a 
scaffold based on a nimiber of rings thereof 

Iq an exemplary embodiment of the invention, the method comprises: 

detOTnining a plurality of gauge molecules that can be generated by adding moieties to 
said potential scaffold molecule; 

determining for an existing library portion what spatial chemical configurations are 
added by said molecules; and 

selectiag said potential scaffold molecule if one or more significant spatial chemical 
configurations can be added by it to said library portion. Optionally, the method comprises 
selecting a scaffold based on a number of configurations added by said scaffold. Altematively 
or additionally, said significant spatial configurations are configurations not previously 
provided or overlapped with. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a method of selecting a gauge molecule to be added to a screening library, comprising: 

providing a set of chemical molecules and at least a part of a screening library; 

selecting a potential gauge molecule firom said set of chemical molecules; 

determining a rigidity of said potential gauge molecule; and 

rejecting said potential gauge molecule responsive to a lack of rigidity of said gauge 
molecule. Optionally, said lack of rigidity is absolute. Altematively, said lack of rigidity is 
relative to other potential scaffolds. 

In an exemplary embodiment of the invention, the method comprises: 

determining a spanning, in chemical configuration space, of said part of a screening 

library; 

detennining at least one spatial chemical configuration of said potential molecule; and 
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selecting said potential gauge molecule if it adds at least one significant spatial 
chemical configuration to said screening library. 

Optionally, providing a set of molecules comprises generating said molecules using a 
single scaffold to which moieties are selectively attached. Alternatively or additionally, 
providing a set of molecules comprises providing a chemical library. 

In an exemplary embodiment of the invention, said gauge is selected if it adds at least 
one spatial chemical configuration not previously provided or overlapping a provided 
configuration. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a method of creating at least aportion of a screening library, conq)rising: 
selecting a scaffold molecule to which moieties can be added; 

determining a pluraUty of potential gauges which can be created by attaching moieties 
to said scaffold; and 

selecting a subset of said gauges that do not substantially overlap in chemical 
configurations. Optionally, the method comprises rejecting potential gauges that add over six 
spatial chemical configurations. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a method of reducing a screening library, comprising: 

for each molecule in at least part of said library, determining substantially all the spatial 
chemical configurations of a certain order of binding points provided by the molecule; and 

removing a plurality of molecules which add redundant spatial chemical 
configurations. Optionally, said certain order is three. 

There is also provided in accordance witii an exemplary embodiment of the invention, 
a method of reducing a screening library, comprising: 

for each molecule in at least part of said library, calculating a binding probability of 
said molecules based on energetic considerations; and 

removing at least some molecules whose binding probability is below a threshold 
value. Optionally, said binding probabiUty is calculated using a formula which is inversely 
dependent on a flexibiUty of the molecule. Alternatively or additionally, said binding 
probability is at least estimated based on a solubility of the molecule. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a method of designing a screening library for a projected target molecule task, comprising: 
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determining a desired range of distances between binding points to be directly 
identified by said library, 

determining a desired overlap between measures provided by gauge molecules of said 

library; 

5 determining a set of desired binding types to be discrinunated between; and 

generating a plurality of gauges, said gauges each defining a plurality of binding types 
and distances between them, such that said gauges cover a spatial chemical configuration space 
that includes said distances and said binding types with said desired overlap. Optionally, 
generating a plurality of moieties comprises g^erating by attaching moieties to scaffolds. 
10 Alternatively or additionally, said gauges cover a spatial chemical configuration space of 
triplets of binding points. Alternatively or additionally, said projected target molecule task 
comprises proteins. 

In an exemplary embodiment of the invention, said overlap is at least two. 
Alternatively said overlap is at least four. Alternatively, said overlap is at least six. 
15 In an exemplary embodiment of the invention, said gauges are substantially rigid. 

Altematively or additionally, said coverage takes into account an inherent flexibility of 
binding. 

In an exemplary embodiment of the invention, generating comprises generating 
substantially same configurations by different gauges, thereby providing at least part of said 
20 overlap. Optionally, generating comprises providing a repetition factor of at least two. 

In an exemplary embodiment of the invention, generating comprises generating 
substantially different configurations by different gauges, which different configurations 
overlap due to a degree of flexibiUty thereof, thereby providing at least part of said overlap. 

In an exemplary embodiment of the invention, the method comprises generating a set 
25 of dmg leads for said target based on said information. Optionally, the method comprises 
removing known drug leads for said target from said set. 

There is also provided for in accordance with an exemplary embodimrat of the 
invention, a lead set produced by one of the methods described above. 

There is also provided in accordance with an exemplary embodiment of the invention, 
30 a drug lead comprising: 

a plurality of substantially rigid scaffolds molecule sections; 

at least one link interconnecting said scaffold molecule sections; and 

a plurality of moieties attached to said scaffolds. 

19 
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ThCTe is also provided in accordance with an exenqilaiy embodiment of the invention, 
a screening library comprising: 

at least 10,000 molecules generated by attaching moieties to a set of fewer than 50 
scaffold molecules. Optionally, fewer than 20 scaffold molecules are used to generate said at 
least 10,000 molecules. Alternatively or additionaUy, said scaffolds include at least one of the 
following scaffold molecules: Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; Pyrimidme; 
Pyrazine; 6H-Thieno[2,3-b]pyirole; l,6-Dihydio-pyrrolo(2,3-b]pyrrole; IH-lhdole; 
Thieno[2,3-d]pyrimidine; 6,7-Dihydro-pyrazolo[l,5-a]pyrimidine; Quinoline; Isoquinoline; 
Quinoxaline; 3,4-Dihydro-ben2o[e][l,4]diazepin-5-one; 3,8-Dihydro-4H-pym)lo[2,3- 
e][l,4]dia2epin-5-one: 3,4-Dihydro-thieno[2.3-e][l,4]diazepin-5-one; 3.6-Dihydio-4H- 
pyrrolo[3,2-e][l,4]diazepin-5-one; 5H,1 lH-Dibenzo[b,£][l,5]diazocine-6,12-dione; 1,4- 
Dihydro-10H-l,4,10-l,4,10-tria2a-benzo[a]cyclopenta[e]cyclooctene-5,ll-dione; 4H,10H-1- 
Thia-4,10-diaza-ben2o[a]cyclopenta[e]cyclooctene-5,l 1-dione; Dipyrrolo[l,2-c;2%l '- 

e]imida2ol-5-one; 1 ,4,7,9-Tetrahydro-l .4,6,9-tetraa2a-dicyclopenta[a,e]cyclooctene-5, 1 0- 
dione; 4,7,9-Trihydro-l-thia-4,6,9-triaza-dicyclopenta[a,e]cyclooctene-5,10-dione; 
2,4,9,Trihydro-llambda*4*,6-dithia-4,9-diaza-dicyclopenta[a,e]cyclooctene-5,10-dione; 6,9- 
Dihydro-5H-l-thia-5,8,9,triaza-cyclopenta[a]a2ulen-4-one; 3,10,Dihydro-4H- 
[l,4]dia2epino[5,6-b]indol-5-one; 3.6-Dihydro-4H-[l,4]diazepino[6,5-b]indol-5-one; 7.8- 
Dihydro-lH-l,7,10-triaza-cyclohepta[e]inden-6-one; 8,9-Dihydro-3H-3,6,9-triaza- 
cyclohepta[e]inden-10-one; 7,8-Dihydro-lH-l,5,8-tria2a-cyclohepta[f|inden-9-one; 8,9- 
Dihydro-5,6,9,ll-tetraaza-cyclohept[b]naphthalene-10-one; 3,4-Dihydro-[l,4]diazepino[5,6- 
b]quinolin-5-one; 8.9-Dihydro-4,8,l l-triaza-cyclohepta[a]naphthalene-7-one; 1 lH-10.1 1- 
Dia2a-benzo[b]fluorine; o-hydroxyacids; a-aminoacids; cohels; Bicyclo[2.2.2]octane; 2- 
MethyIene-2,3-dihydrobenzo[l,4]dioxine; 6,7-Dihydro-2H-pyra2ino[l,2-a]pyramidine; 9H- 
Fluorene; l,4-Diaza-bictclo[2.2.2]octane; l-Aza-bicyclo[2.2.2]octane; Pyrido[2,3- 
djpyrimidine; 5-Methylene-l,5-dihydro-pytrol-2-one; Be2no[4,5]imidazo[l,2-a]pyrimidine; 
l,4-Dihydro-ben2o[4,5]imidazo[l,2-a]pyrimidine;4,10.Dihydro-l,4a,10-triaza-phenanthren-9- 
one; 1.5-Dihydro-imidazo[l,2-a]pyrimidin-2-one; l,2,3,5-Tetrahydro-imidazo[l,2- 
a]pyrimidine; Thia2olo[3,2-a]thieno[2,3-d]pyrimidin-5-one; l,9-Dithia-4a,10-diaza- 
cyclopenta[b]£luoren-4-one; 5,6-Dihydro-l-thia-5,7,8,9a-tetraaza-cyclopenta[e]azulen-4-one; 
6,10-Dihydro-5H-l-thia-5,7,10a-triaza-benzo[e]azulen-4-one; 4,5-Dihydro-3-thia-4,5a,10- 
triaza-cyclopenta[a]fluorine; 8H-l-Thia-cyclopenta[a]indene; 3-Thia-4,5a. 1 0-triaza- 
cyclopenta[a]£luorine; 6,7.9,ll-Tetrahydro-10-thia-6,9-diaza-indeno[l,2-a]azulene-5,8-dione; 
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23,6J,12a-Hexahydropyra2ino[l\2':l,6]pyrido[3,4-b]m^ 5,10-Dihydro-4H- 
2,3a,10-triaza-cyclopenta[a]fluorine; 5H-Pyrido[4,3-b]indole; llH-lhdolizino[l,2-b]quinolin- 
9-one; l,2-Dihydro-2,4a,9,-tria2a-anthracene-3,10-dione; 6H-Isoindolo[2,Ua]indole; 1,5- 
Dihydro-benzo[b][l,4]diazepm-2-one; 5,10-Dihydro-diben2o[b,e][l,4]diazepin-ll-one; 5,11- 
5 Dihydro-ben2o[e]pyrido[3,2-b] [1 ,4]diazepin-6-one; 4,9-Dihydro-3-thia-4,9-diaza- 
ben2o[f|azulen-10-one; Benzo[g]quiiioxaline; Pyrazmo[2,3-b]qumoxaline; Pyrido[2,l - 
b]quinazolin-l 1 -one; 1 -Thia-4a,9-diaza-cyclopenta[b]naphthalene-4-one; 2-Methylene-4H- 
benzo[ 1 ,4]thiazm-3-one. 

In an exemplary CTibodiment of the invention, at least 4 of said scaffolds have exactly a 
10 single ring. Alternatively or additionally, at least 4 of said scaffolds have exactly two rings. 
Alternatively or additionally, at least 4 of said scaffolds have exactly three rings. Alternatively 
or additionally, at least 4 of said scaffolds have exactly four rings. Alternatively or 
additionally, said library includes at least 50,000 thus generated molecules. Alternatively or 
additionally, said library includes at least 100,000 thus generated molecules. 
15 In an exemplary embodiment of the invention, said scaffolds include at least three of 

said following scaffold molecules. Alternatively or additionally, said scaffolds include at least 
ten of said following scaffold molecules. 

In an exemplary embodiment of the invention, said generated molecules are 
substantially rigid. Alternatively or additionally, said molecules span a configuration space of 
20 spatial geometrical patterns of binding point types, including at least 25% of the patterns that 
exist in protein targets. Optionally, said molecules span at least 50% of the patterns. 

In an exemplary embodiment of the invention, said moleciiles span a space defining at 
least 4 distinct binding point chemistry types. 

In an exemplary embodiment of the invention, said molecules span a space defining at 
25 least 5 distinct binding point chemistry types. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a screening library, comprising: 

at least 100 gauge molecules generated by attaching moieties to at least one of the 
following scaffolds: Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; Pyrimidine; Pyrazine; 
30 6H-Thieno[2,3-b]pyrrole; 1 ,6-Dihydro-pyrrolo[2,3-b]pyrrole; IH-Indole; Thieno[2,3- 
d]pyrimidine; 6,7-Dihydro-pyrazolo[l,5-a]pyrimidine; Quinoline; Isoquinoline; Quinoxaline; 
3,4-Dihydro-benzo[e][l,4]diazepin-5-one; 3,8-Dihydro-4H-pyrrolo[2,3-e][l,4]dia2epin-5-one; 
3,4.Dihydro-thieno[2,3-e] [ 1 ,4]diazq)in-5-one; 3,6-Dihydro-4H-pyrrolo[3,2-e] [ 1 ,4]dia2epin-5- 
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one; 5H,1 lH-Dibeii2o[b,f][l,5]dia2ocme-6,12-dione; l,4-Diliydro-10H-l,4,10-l,4,10-triaza- 
benzo[a]cyclopenta[e]cyclooctene-5,l 1-dione; 4H,10H-l-Thia-4,10-<iiaza- 
beiizo[a]cyclopenta[e]cyclooctene-5.1 1-dione; Dipyrrolo[l,2-c;2',l '-e]imidazol-5-one; 

1 ,4,7,9-Tetrahydro- 1 ,4,6,9-tetraaza-dicyclopenta[a,e]cyclooctene-5 , 1 0-dione; 4,7,9-Trihydro- 
l-tiiia-4,6,9-tiiaza-dicyclopenta[a,e]cyclooctene-5,10-dione; 2,4,9,Trihydro-llambda*4*,6- 
dithia-4,9-diaza-dicyclopenta[a,e]cyclooctene-5, 1 0-dione; 6,9-Dihydro-5H- 1 -thia-5,8,9,triaza- 
cyclopenta[a]azulen-4-one; 3,10,Dihydro-4H-[l,4]diazepino[5,6-b]indol-5-one; 3,6-Dihydro- 
4H-[l,4]diazepino[6,5-b]indol-5-one; 7,8-Dihydro-lH-l,7,10-triaza-cyclohepta[e]inden-6-one; 
8,9-Dihydro-3H-3,6,9-triaza-cyclohepta[e]inden-10-one; 7,8-Dihydro-lH-l,5,8-triaza- 
cyclohepta[f]inden-9-one; 8,9-I>ihydro-5,6,9,l l-tetraaza-cyclohept[b3naphthalene-10-one; 3,4- 
Dihydro-[l,4]dia2epino[5,6-b]quinolin-5-one; 8,9-Dihydio-4,8,l 1-triaza- 

cyclohepta[a]naphthalene-7-one; llH-10,ll-Diaza-benzo[b]fluorine; a-hydroxyacids; a- 
aminoacids; cohels; Bicyclo[2.2.2]octane; 2-Methylene-2,3-dihydrobenzo[l,4]dioxine; 6,7- 
Dihydro-2H-pyrazino[l,2-a]pyrainidine; 9H-Fluorene; l,4-Diaza-bictclo[2.2.2]octane; 1-Aza- 
bicyclo[2.2.2]octane; Pyrido[2,3-d]pyrimidine; 5-Methylene-l,5-dihydro-pyrrol-2-one; 
Bezno[4,5]imidazo[l,2-a]pyrimidine; l,4-Dihydro-ben2o[4,5]imidazo[l,2-a]pyrimidine; 4,10- 
Dihydro-1 ,4a, 1 0-triaza-phenanthren-9-one; l,5-Diliydro-imidazo[ 1 ,2-a]pyriniidin-2-one; 
l,2,3,5-Tetrahydro-inuda2o[l,2-a]pyrimidine; Thiazolo[3,2-a]thieno[2,3-d]pyriniidin-5-one; 

1.9- Difhia-4a,10-diaza-cyclopenta[b]fluoFen-4-one; 5,6-Dihydro-l-thia-5,7,8,9a-tetraaza- 
cyclopenta[e]azuIen-4-one; 6, 1 0-Dihydro-5H- 1 -thia-5,7, 1 0a-triaza-benzo[e]azulen-4-one; 4,5- 
Diliydro-3-thia-4,5a,10-tria2a-cyclopenta[a]fluorine; 8H-l-Thia-cyclopenta[a]indene; 3-Tbia- 
4,5a, 1 0-triaza-cyclopenta[a]fluorine; 6.7,9, 1 1 -Tetrahydro-l 0-thia-6,9-dia2a-indeno[ 1 ,2- 

a] azulene-5,8-dione; 2,3 ,6,7, 1 2a-Hexahydropyrazino[ 1 ',2' : 1 ,6]pyrido[3,4-b]indole- 1 ,4-dione; 

5.10- Dihydro-4H-2,3a,10-triaza-cyclopenta[a]fluorine; 5H-Pyrido[4,3-b]indole; 1 IH- 
Indolizino[l,2-b]quinolin-9-one; l,2-Dihydro-2,4a,9,-triaza-anthracene-3,10-dione; 6H- 
Isoindolo[2, 1 -a]indole; 1 ,5-Dihydio-ben2o[b] [ 1 ,4]dia2epin-2-one; 5, 1 0-Dihydro- 
diben2o[b,e][l,4]diazepin-l 1-one; 5,1 l-Dihydio-ben2o[e]pyrido[3,2-b][l,4]dia2epin-6-one; 
4,9-Dihydro-3-thia-4,9-dia2a-benzo[f]azulen-10-one; Beaizo[g]quinoxaline; Pyrazino[2,3- 

b] quinoxaline; Pyrido[2,l-b]quinazolin-l 1-one; l-Thia-4a,9-dia2a-cyclopenta(b]naphthalene- 
4-one; 2-Methylene-4H-benzo [ 1 ,4] thia2in-3-one. 

Optionally, said molecules are generated \ising at least one of the following scaffolds: 
Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; Pyrimidine; Pyrazine; 6H-Thieno[2,3- 
b]pyiTole; l,6-Dihydro-pyrrolo[2,3-b]pynDle; IH-Indole; Thieno[2,3-d]pyrimidine; 6,7- 
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Dihydro-pyrazolo[l,5-a]pyriimdine; Quinoline; Isoquinoline; Quinoxaline; 3,4-Dihydro- 
benzo[e][l,4]diazepm-5-one; 3,8-Dihydro-4H-pyrrolo[2,3-e][l,4]ciia2epm.5-one; 3,4-Dihydro. 
thieno[2,3-e][l,4]dia2epin-5-one; 3,6-Dihydro-4H-pyrrolo[3,2-e][l,4]dia2epin-5-one; 5H,11H- 
Dibenzo[b,fl[l,5]dia20cme-6,12-dione; l,4-DihydrO'10H-l,4,10-l,4,10-triaza- 
5 benzo[a]cyclopenta[e]cyclooctene-5,l 1-dione; 4H,10H-l-Thia-4,10-diaza- 
benzo[a]cyclopenta[e]cyclooctene-5,l 1-dione; Dipyrrolo[l,2-c;2*,r-e]iinidazol-5-one. 

In an exemplary embodiment of the invention, said at least 100 molecules comprise at 
least 300 molecules. Alternatively or additionally, said at least 100 molecules of said library 
are generated using a single one of said scaffolds. 
10 There is also provided in accordance with an exemplary embodiment of the invention, 

a screening library comprismg a set of at least 10,000 substantially rigid molecules. Optionally, 
said set comprises at least 50,000 substantially rigid molecules. Altematively or additionally, 
said set comprises at least 100,000 substantially rigid molecules. 

In an exemplary embodiment of the invention, said set is selected to have a an expected 
15 binding rate of at least 0.1% of the library for protein targets in general. Optionally, said 
expected binding rate is at least 0.5%. 

In an exemplary embodin^ent of the invention, said set is designed to provide 
molecules with a uniformity of hit probabihty for a generalized target of within a ratio of 1 :100 
for the whole set. Optionally, said ratio is witiiin 1:10. 
20 hi an exemplary embodiment of the invention, said set spans a space of spatial 

chemical configurations, each such configuration defining a certain plurality of binding points 
having distances between them, tiie set covering substantially all possible configurations in the 
space in a given range of distances. 

There is also provided in accordance with an exemplary embodiment of the invention, 
25 a screening library, comprising: 

a plurality of at least 5,000 gauge molecules, each such molecule defining at least one 
spatial configuration of binding type points, 

wherein substantially each point in a space of such configurations is covered by at least 
two different gauge molecules. Optionally, each point is covered by at least two substantially 
30 identical spatial configurations, Altematively or additionally, each point is covered by at least 
two substantially different spatial configurations. Altematively or additionally, said space is a 
space of triangles defined by binding type at vertexes and distances between vertexes. 
Optionally, said space includes distances of between 4 A and 8 A (angstrom = 10"10 meters). 
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Alternatively or additionaHy, said space includes distances of between 2 A, and 10 A. 
Alternatively or additionally, said space includes at least 5 different binding types. Optionally, 
said space includes at least 7 different binding types. 

In an exraiplary embodiment of the invention, said space includes omni-directional 
5 binding types. Altematively or additionally, said space includes directional binding types. 

In an exemplary embodiment of the invention, said substantially each point in said 
space is covered by at least three gauges. 

In an exemplary embodiment of the invention, substantially all the gauges include a 
plurality of configurations of said ^ace. 
10 There is also provided in accordance with an exemplary embodiment of the invention, 

a method of obtaining information about a bindiug behavior of a target molecule, comprising: 

providing a set of substantially rigid chemical gauges, a significant number of said 
gauges being expected to bind with said target; 

reacting said target with a plurality of gauges of said set of gauges; and 
15 physically analyzing a structure of said target bound to a gauge. Optionally, physically 

analyzing comprises analyzing using NMR. Altematively or additionally, physically analyzing 
comprises analyzing using X-ray crystallography. Altematively or additionally, physically 
analyzing comprises analyzing using binding with a set of gauges. Altematively or 
additionally, the method comprises virtually super-imposing a plurality of structures obtained 
20 by said physically analyzing. 

There is also provided in accordance with an exemplary embodiment of the invention, 
a method of constracting a lead, comprising: 

providing a set of substantially rigid chemical gauges; 

reacting said target with a plurality of gauges of said set of gauges; 
25 assaying a binding of said gauges with said target to obtain a pluraUty of assay results; 

and 

constmcting a lead based on said assay results. Optionally, constmcting a lead 
comprises linking together a plurality of gauges found to bind in said assaying. Altematively or 
additionally, constmcting a lead comprises modifying an existing molecule to have moieties 
30 that correspond to binding locations found by said assaying. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Non-limiting embodiments of the invention will be described with reference to the 
following description of exemplary embodiments, in conjunction with the figures. The figures 
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are generally not shown to scale and any measurements are only meant to be exemplary and 
not necessarily limiting. In the figures, identical stmctures, elements or parts which appear in 
more than one figure are preferably labeled with a same or similar number in all the figures in 
which they appear, in which: 
5 Fig. 1 is a schematic diagram of a target protein including a plurality of binding points; 

Fig. 2 is a flowchart of a method of dmg discovery, in accordance with an exemplary 
embodiment of the invention; 

Fig. 3 is a flowchart of a method of target measurement, in accordance with an 
exemplary embodiment of the invention; 
"•O Fig- 4A is a sdiematic illustration of an exemplary gauge, in . accordance with an 

exemplary embodiment of tiie invention; 

Fig. 4B shows the gauge of Fig. 4A, interacting vidth the target protein of Fig. 1; 

Fig. 5 is a flowchart of a method of determining which triangles did bind to a target, in 
accordance with an exemplary embodiment of the invention; 
15 Fig. 6 A is a flowchart of a method of determining a spatial layout of binding locations 

from the results of the method of Fig. 5, in accordance with an exemplary embodiment of the 
invention; and 

Fig. 6B is a flowchart of an alternative method of determining a spatial layout of 
binding locations firom the results of the method of Fig. 5, in accordance with an exemplary 
20 embodiment of the mvention. 

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS 

1. OVERVIEW 

2. EXEMPLARY PROCESS OF DRUG DISCOVERY 

3. DETAILS OF PROCESS 

25 3.1 TARGET MEASUREMENT 

4. EXEMPLARY ASSAYS 

4.1 FUNCTIONAL ASSAY 

4.2 BINDING ASSAY 

5. GAUGES. GENERAL 

30 5.1 EXEMPLARY GAUGE 

5.2 NUMBER OF MOIETIES IN A MEASURE 

5.3 NUMBER OF MOIETIES IN A GAUGE 

5.4 MOIETY TYPES 

5.5 OVERLAP OF MEASURES IN A SET 
35 6. RECONSTRUCTION 

6.1 TRIANGLE EXTRACTION 

6.2 LAYOUT CONFIGURATION RECONSTRUCTION 

6.3 RECONSTRUCTION VARIATIONS 

6.4 ALTERNATIVE RECONSTRUCTION METHOD 
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7.1 OVERVIEW 

7.2 RECONSTRUCTION VERIFICATION 

7.3 BINDING STRENGTH 

5 7.4 INTER-BOND INTERACTIONS 

7.5 GEOMETRIC ANALYSIS 

7.6 DETERMINATION OF STERIC CLASHES 

7.7 IDENTIFICATION OF CONTROL AREAS 

7.8 OTHER MAP ANALYSIS 

10 8. USE IN DRUG DISCOVERY PROCESSES 

8.1 OVERVIEW 

8.2 DRUG GENERATION 

8.3 LEAD GENERATION 

8.4 LEAD DESCRIPTION 
15 8.5 LEAD SEARCH 

8.6 LEAD REJECTION 

8.7 TARGETED MAPPING 

8.8 TARGET SUITABILITY TESTING 

8.9 TARGET PARTITIONING 

20 8.10 DRUG AND LEAD ANALYSIS AND ENHANCEMENT 

8.11 DRUG SELECTION 

8.12 DRUG ENHANCEMENT 

8.13 DRUG FAILURE ANALYSIS AND REENGINEERING 

8.14 ADDITIONAL DRUG DISCOVERY RELATED ANALYSIS 
25 8.15 STREAMLINE DISCOVERY PROCESS 

8.1 6 UTILITY GENERATION 

9. EXEMPLARY DISCOVERY APPLICATIONS 

9.1 OVERVIEW 

9.2 SCREENING BASED DRUG DESIGN 

30 9.3 ALTERNATIVE SCREENING BASED DRUG DESIGN 

9.4 STRUCTURE-BASED DRUG DESIGN 

9.5 MODULAR ASSEMBLY OF LIGANDS 

10. EXEMPLARY NON-DISCOVERY USES 

11. USING PRIOR INFORMATION 
35 12. ITERATIVE MEASUREMENT 

13. GAUGES, PHYSICAL PROPERTIES 

13.1 OVERVIEW 

13.2 SCAFFOLD 

13.3 VOLUMETRIC GEOMETRY OF GAUGES 
40 13.4 FLEXIBILITY 

13.5 GAUGE LENGTHS 

13.6 ENVIRONMENTAL STABILITY 

13.7 UNIQUENESS OF GAUGES AND OVERLAP OF SIDES AND TRIANGLES 

13.8 GAUGE MASS AND SIZE 

45 14. PARTICULAR AND GENERAL GAUGE SET DESIGN 

14.1 EXAMPLE SPANNING LIBRARY SIZE 

14.2 GAUGE SUBSET SELECTION 

14.3 GAUGE LIBRARY DESIGN 

14.4 LIBRARY BUILDING METHOD 

50 14.5 SCAFFOLD SELECTION METHOD 
14.6 GAUGE SELECTION METHOD 
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14.7 GAUGE SYNTHESIS 

14.8 MIXED LIBRARY DESIGN 

14.9 ENSURING LIBRARY RELIABILITY 

14.10 HUMAN INTERACTION DURING LIBRARY DESIGN 

15. EXPERIMENTS AND EXAMPLES 

15.1 EXPERIMENT 1 

15.2 EXPERIMENT 2 

16. SYNTHESIS BOOK 

16.1 Benzenes, Pyrlmidines 6-membered ring scaffold 

16.2 lndolo[2,3-b]quinoline 6,6.5.6 cyclic scaffold 

16.3 isoindoloindoles and isoindoloindolones 6.5,5.6 tetra cyclic scaffolds 
16.3.1 Isoindoloindolones 

16.4 The single atom scaffold 

16.5 Benzodiazepines 6,7 bicyclic scaffold 

16.6 Pyrazinoquinazolinone -6,6,6 tricyclic scaffold 

16.7 Pynrole -5 mennbered ring scaffold 

1 6.8 Tfiiophenes and related scaffolds 

1 6.8.1 5, 5 blcydlc scaffolds 

16.8.2 5,6-bicyclic scaffolds 

16.8.3 5,8,5 5,8,6 tricyclic and 5,5,8,6 5,5,8,6 tetracyclic scaffolds 

16.8.4 5,7 bicyclic scaffold 

16.8.5 5,6,5,6 Tetracyclic and 5,6,5 tricyclic scaffolds 

1 6.8.6 5-6-5-6 tetracyclic scaffold 

16.8.7 5-6-5 tricyclic scaffold 
1. OVERVIEW 

The high specificity of many biological molecules, such as enzymes, is created by the 
existence, in such a molecule, of a particular spatial arrangement of binding locations. It is 
believed that for a substrate molecule to succeed in usefully interacting with the enzyme, it 
must match (at least part of) the particular spatial arrangement. In the pharmaceutical industry, 
this specificity can be utilized by finding small molecules fliat mimic flie shape and chemical 
affinities of the substrate molecule. In a typical drug discovery method, such a small molecule 
is found by trying out millions of small molecules and, once finding a molecule which jqjpears 
to have some affinity, chemically fine tuning that "lead" until a better binding is found. In an 
exemplary embodiment of the invention, the particular spatial arrangement is mapped and this 
map is used to assist in the drug discovery process and, ultimately, in finding new and useful 
small molecule drugs. It should be noted that, in general, the spatial geometry of the binding 
locations is three dimensional. 

In the following description, the molecule is called a target and the spatial arrangemMit 
is called a target area or a pharmacophore. However, as will be clear, a mapping method in 
accordance with an exemplary embodiment of the invention and/or its derivatives have uses 
beyond drug discovery, for example, developing herbicides and targeted anti-bodies. Thus, the 
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tenns used are used for convenience and not for limiting the desired coverage, except where 
noted otherwise. 

Fig. 1 is a schematic diagram of a target protein 100 including a plurality of binding 
locations 102 (and 108). As shown, binding locations 102 are arranged in a target area 104, 
which is designed to accept the substrate of the protein. In some proteins, a target area of 
interest is a control area 106 of the protein (with binding locations 108), which, when bound, 
changes the behavior of the protein (e.g., changing the configuration of the substrate receiving 
area of the protein). Possibly, a plurality of non-fimctional binding locations 110 are found on 
the outside of the protein. 

Although the following description focuses on finding small molecules for affecting 
enzymatic proteins, target 100 may be any bio-molecule whose biological behavior may be 
desirably affected by the binding of a molecule to it. For example, target 100 may be one or 
more of DNA, RNA, signaling proteins such as hormones, structural hormones, growth 
factors, other proteins, anti-bodies, cell receptors, ion channels, cytokines, complexes, 
membranes, toxins (biological and synthetic), small and large molecule drugs and 
carbohydrates. Non-biological application are also envisioned, for example for assessing 
enzymes used for washing and industrial uses. In addition, the searched-for molecule need not 
be a small molecule, for some applications, for example, it may be a peptide, protein, antibody 
or metal complex. 

In accordance with some exemplary embodiments of the invention, the mapping of 
target area 104 (or 106) is provided by making multiple geometrical and/or chemical affinily 
measurements of the target area and then correlating the measurements to provide a three 
dimensional model of target area 104. In an exemplary embodiment of the invention, the 
measurements are made using a set of selective gauge molecules, hi an exemplary embodiment 
of the invention, the gauges are selective to certain bond geometries and/or certain chemical 
affinities, with an optional small range of flexibility. In a set of gauges a large range of 
geometries, sizes and/or affinities is optionally provided using a larger number of specific 
gauges. 

In an exemplary embodiment of the invention, each gauge molecule makes multiple 
measurements simultaneously and there is an overlap between the measiirements made by 
different gauge molecules. A processing step is optionally provided in which the composite 
measurement fi-om gauges are inter-related to yield an indication of individual measurements 
which are then used for reconstructing a three-dimensional map. Additional side inforaiation is 
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optionally used for the processing and/or or for analyzing and/or using the results of the 
processing. Various examples of such side information are described below. 

2. EXEMPLARY PROCESS OF DRUG DISCOVERY 

Fig. 2 is a flowchart of a method of drug discovery 200, in accordance with an 
5 exemplary embodiment of the invention. At 202, a target 100 for which a dmg is to be 
developed, is provided Optionally, at 204, a subset of gauges is selected for the measurement 
of target 100. Alternatively, a single set of gauges is used for all targets. 

At 206, the gauges are used to measure the spatial layout of interaction locations 102 
and/or 108. 

10 At 208, a model of at least part of the active and/or control areas of target 100 is 

reconstructed from the measurements. At 210 and 212 one or more molecules that match the 
measurements are determined. At 214, the matching molecules are further processed to 
provide dmgs. 

Further details of this method are described below. Altemative methods are also 
1 5 described below. 

3. DETAILS OF PROCESS 

3.1 TARGET MEASUREMENT 

Fig. 3 is a flowchart of a method of target measurement 300, in accordance with an 

exemplary embodiment of the invention. At 302, an amount of target 100 and one or more 
20 gauges are combined in a container, and possibly allowed to incubate (304) so that gauges can 
bind to interaction locations in target 100. In some embodiments of the invention, the target is 
also incubated with a substrate or another molecule. Such incubation may be provided for 
various reasons, for example, to force a conformal change on the target to assist in dissolving, 
to keep the target aUve and/or as part of a functional assay. The target may be in a relatively 
25 pure state, for example a purified replicated DNA segment. Alternatively, the target may be 
provided in a more natural environment, for example in a living cell or with associated 
molecules (e.g., whose interactive effects may be unknown). Optionally, a plurality of 
overlapping gauges (i.e., overlapping in them being able to measure same or similar spatial 
geometries) are incubated together in a same assay. 
30 At 306, the degree of binding of the gauge to target 100 is optionally determined. The 

method used may depend on the type of assay used, various examples of which are provided 
below. Alternatively or additionally, at 308, an effect on the function of target 100 is 
determined, various examples are provided below. 
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It should be noted that assays suitable for detecting binding of a test molecule to a 
target molecule are well known for drug discovery and many are suitable for the present 
invention, possibly with no modification. 

The assaying process may then be repeated with a different gauge and/or different 
conditions (310), such as solvent, temperature and pH. Varying the conditions may be used, for 
example, to determine the strength of the binding and/or to compensate for unavailable gauges, 
for example by forcing conformal changes on target 100. The repetition may depend on 
preliminary binding results for one or more gauges and/or may depend on preliminaiy 
measurements or measurement failures. 

In an exemplary embodiment of the invention, the assays are at a 1-100 micro Molar 
concentration of the gauge. However, other concentrations may be used. The concentration 
may depend, for example, on the solubility of the gauge and/or various toxic or other effects 
associated with the gauge. In many cases, the concentrations used will depend on the 
sensitivity of the assay. 

The purity of the target may or may not be important, for example, depending on the 
affinity of the gauge to the impurities and/or on the sensitivity of the assay to the impurities. 
4. EXEMPLARY ASSAYS 

4.1 FUNCTIONAL ASSAY 

Many types of functional assaying methods are known in the^. In general, the treated 
target is provided with its normal substrate (for proteins) and a measurement of enzymatic 
activity is used to determine the functional effect of the gauge, relative to a baseline or a 
control portion of material. Automated parallel assay devices, such as manufactured by Tecan 
(Switzerland), Zymark (USA) or Cybio (DE) can perform multiple functional assays in 
parallel, for example, for different gauges and/or for better statistics on a single gauge-target 
match. 

Functional assays may be on various levels, for example, on a molecular, cellular or 
organism level. In general, any known functional assay may be used to assay the functionality 
of a gauge. 

In an exemplary embodiment of the invention, the gauge acts like a ligand of the target 
and compete or otherwise affects the functionality of the target. These effects may be of 
various types, for example, the gauge may bind where the normal substrate is supposed to 
bind, the gauge may bind near where the substrate binds, but still block the substrate fi-om 
binding, the gauge may bind in a way that does not block the substrate but would, if the gauge 
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were larger (sxiitable for a binding assay) and/or the gauge may be agnostic rather than 
antagonistic in its behavior, enhancing the affinity of the target for the substrate. 

DNA targets can be assayed, for example, using replication methods (e.g., to see if 
replication is inhibited or enhanced). Alternatively, DNA targets are assayed by detemiining 
5 their interaction with DNA chips after the test binding. Such DNA chips typically include a 
substrate on which a plurality of short DNA segmwits are mounted in a known patten, with 
the segments selected to bind (e.g. be specific and complementary) to portions of a searched 
for DNA sequence and/or match sections of a non-linear DNA segment It is expected that the 
type and/or relative firequencies of bindings to various short DNA segments on a DNA chip 
10 depend on the degree and/or location of binding of a gauge to a DNA molecule. For example, a 
gauge may block a certain part of a DNA molecule firom matching up with a DNA chip 
segment. In another example, a gauge may force a conformal change in a DNA molecule, 
which change will interfere with binding with one DNA chip segment but which may allow 
binding with a previously unsuitable DNA chip segment. 

15 4.2 BINDING ASSAY 

In a binding assay, the binding of a gauge to a target is directly measured. It should be 
noted, however, tiiat a binding assay may be less indicative than a functional assay, as a gauge 
can bind at a location outside of the target area and provide no useful information about the 
target area. In addition, the sensitivity of a binding assay may be lower, since the detection 

20 sensitivity of binding is usually lower and typical binding rates are also quite low. However, in 
some cases, a functional assay cannot be patformed, for example if the gauge interacts with the 
substrate, or if a target function is not known, or may be difficult or time consuming to 
perform, for example if the assay requires a living cell. Also, a gauge may bind in an active 
area without this binding affecting the functionality, as measured by a particular functional 

25 assay. 

Various types of binding assays are known in the art and may be used, for example as 
described in the Handbook of Drug Screening, edited by Ramakrishna Seethala and 
Prabhavathi B. Femandes, in Drugs and the Pharmaceutical Sciences, Volume 114, New York, 
NY, Marcel Dekker, 2001, the disclosure of which is incorporated herein by reference. 
30 Both functional assays and binding assays may be performed in many ways, the current 

technology being robotic performance of tests and flie emerging technology being flow- 
through analysis (e.g., using DNA chips). It should be noted that 100,000 test systems are 
becoming available, which means that in some embodiments of the invention, screening using 
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a gauge library can be completed in one step (day). OptionaUy, this is used to prevent the need 
to clean out gauge delivery systems between screening targets. 

hi some embodiments of the invention, the binding assay (of a functional assay) 
includes modifying a gauge, for example, attaching a fluorescent material to the gauge. 
Depending on the attachment point, this may cause conformal changes in some of the gauges 
and/or cause steric clashes. It is expected that the overlap between gauges will overcome fliis 
problem, at least in a significant number of cases. 

In other embodiments of the invention, the gauges are not changed, or are changed in 
non-material ways. For example, for an NMR binding assay or an x-ray crystallography 
binding assay no change is required. In a radioactivity based assay, radioactive isotopes can be 
used in the gauges, hi an exen^lary embodiment of the invention, non-iadioactive isotopes 
(half spin isotopes) are used in producing the gauges, to provide binding detection and/or 
better analj^is of NMR data. In these assays, unbound gauges may be sq)arated j&x>m the 
targets, for example, using methods known in the art, for example, if the target is bound to a 
surface, washing will remove unbound gauges. 

In some embodiments of the invention, the binding of the gauge has a non-functional 
efifect on the target, which may be detected or measured, for example, affecting a vibration 
frequency of a fluorescent tail attached to the gauge or the target. In an exemplary embodiment 
of the invention, the gauge binds with the target in a manner similar to that of a ligand of the 
target. Various techniques, for example as known m the art (e.g., NMR, IR) may be used to 
analyze the combined target/gauge structure. Optionally, once a binding gauge or other 
substrate is found, a gauge set is used to measure the combmed target/ligand structure. 

In some binding assays, a plurality of differently marked gauges may be assayed 
simultaneously and possibly differentially, for example, by a attaching a different florescent 
marker to different gauge used together and/or using different radioactive isotopes for different 
gauges. 

Optionally, the binding assay (and/or a functional assay) may include changing various 
environmental parameters, such as temperature, pH and/or other environmental variables, for 
example to determine a strength of binding. 

In an exemplary embodiment of the invention, a binding assay is used to determine a 
baseline level of binding of the gauge outside active areas of the target. In one example, the 
degree of binding of a particular gauge to alpha helixes in a protein may be known from an 
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analog of the target. The total binding to the target, however, includes bindings to non-helix 
parts of the protein and/or target areas of the target. 

In an exemplary embodiment of the invention, it is noted that a large number of hits are 
expected and/or an overlap betsvcOT gauges is provided. As a result, lower quality and/or faster 
assays are used, since noise caused by low binding rates may be less of a problem. In one 
example, borderline results form two assays are combined, based on a repetition of triangular 
measures between the gauges used in the assays. 
5. GAUGES, GENERAL 

5.1 EXEMPLARY GAUGE 

Fig. 4A is a schematic illustration of an exemplary gauge 400, in accordance with an 
embodiment of the invention. 

Gauge 400 comprises a scaffold 402 and four chemical moieties 406, 410, 414 and 422 
attached to scaffold 402 via bonds 404, 408, 412 and 420, respectively. This is only an 
exemplary gauge, as the properties of all of these elements may vary, for example as described 
below. In particular, one or more of the type of moiety, number of moieties, type of bond, 
distance between moiety and scaffold, type of scaffold and location of connection to the 
scaffold may be varied for different gauges, sets of gauges and/or embodiments of the 
invention. 

In an exemplary embodiment of the invention, a plurality of moieties cooperate to 
define a measure. In an exemplary embodiment of the invention, the gauge purpose is to detect 
interaction locations that bind to those moieties that define a measure at the distances between 
the moieties. The matching of a measure to the target molecule may be indicated by the 
binding of the gauge. In an exemplary embodiment of the invention, a basic unit of measure is 
a triangle (or other geometric shape) defined by a subset of all the moieties. As will be 
described below, the shape of a triangle has particular properties which make it suitable for 
some embodiments. In general, if a gauge includes more than the number of moieties in a 
measure (e.g., more than two moieties for a linear measure, more than three for a triangle), 
more than one measure may be provided by a single gauge. Thus, in the exemplary 
embodiment of the invention shown, a plurality of different triangle measures are defined in a 
single gauge. In some embodiments and in some cases, a gauge will include only one measure, 
for example, gauge 400 includes only a single four-point measure, but four triangle measures. 
Exemplary methods of deteraiining which of various possible measures actually boimd, are 
described below. 
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One triangle geometry is shown by dashed lines 416, 418 and 420 that define the 

distances between pairs of moieties of gauge 400. As noted above, in an exemplary 

embodiment of the invention, ttie gauge purpose is to detect interaction locations that bind to 

those moieties (406, 410, 414) at the distances defined by dashed lines 416, 418 and 420 (e.g., 

5 triangle sides). Assuming gauge 400 included only moieties 406, 410 and 414, then a binding 

of gauge 400 to target 100 can be is used as an indication that three interaction sites, of a type 

suitable to bind to moieties 406, 410 and 414 are approximately at the respective distances 

defined by gauge 400. Since gauge 400 defines multiple triangles, a binding of gauge 400 

indicates that at least one of the triangles defined by the moieties, binds. 

10 Fig. 4B shows gauge 400, interacting with target 100, at three interaction locations 450, 

452 and 454. Non-interacting moieties and the rest of the gauge are not shown. 

5.2 NUMBER OF MOIETIES IN A MEASURE 

As noted, each plurality of moieties defines a measures. While the present invention 
accommodates, in some embodiments thereof measures, with two, fliree, four and/or other 
15 numbers of moieties and/or gauge sets including a mixture of different measures, in an 
exemplary embodiment of the invention, the basic measure used is a triangle, with three 
moieties. Using a triangle may provide one or more of the following potential benefits: 

(a) A triangle defines a stable spatial relationship, which may be usefiil as a unit 
component when "constructing*' a model in three dimensions of the target area, firom the 

20 binding results. 

(b) There are fewer possible triangles than four-sided measures (for example). Thus, 
generating a library that includes measures that cover an entire space is less time consuming. 
Fiulher, as it is desirable in some embodiments of the invention to provide overlap between 
measures, such overlapping measures are more easily provided if there are fewer measures. It 

25 is possible that chemical limitations may prevent tiie construction of high-order measure gauge 
Ubraries. 

(c) A triangle always Ues in a plane (e.g., three points define a plane), which may be 
mathematically usefiil for some reconstmction mefiiods. 

(d) For some applications, a triangle represents the lowest number of binding points 
30 that will resuU in a measurable binding to a target active area. A typical drug includes six or 

more binding points, often as many as ten or more. Conversely, a higher-order measure may 
bind too strongly. In other appUcations, the optimal mmiber of moieties in a measure may be 
higher or lower, of course. 
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Alternatively, a measure including two moieties are used, for example, defining Unes. 
Alternatively or additionally, four- or higher valance measures are used, for example, to define 
more uniquely an interaction location configuration. In some embodiments of the invention, a 
mix of different valance measures may be used, in the gauge set and/or in the reconstruction, 
for example, 2-point, 3-point, 4-point and 5-point measures, which may or may not be planar. 

5.3 NUMBER OF MOIETIES IN A GAUGE 

In an exemplary embodiment of the invention, the number of moieties in a gauge is 
between four and ten, however, a smaller (e.g., three) or greater number may be provided. 
Some scaffolds may be limited in the number of dififerent moieties, moiety positions and/or 
moieties combinations possible. Larg^ numbers of moieties are generally desirable if the 
moieties define dififerent triangle measures. Conversely, gauges with multiple attachment 
points and/or gauges with many moieties may be more prone to steric clashes and/or other 
adverse interactions between tiie moieties, which inhibit binding. 

While the scaffold itself has chemical properties and may be considered as having 
moieties, in some embodiments of the invention, these properties are ignored, for example 
during Ubrary design and/or during binding results analysis. Alternatively, the properties of the 
scaffold may be considered, for example only during analysis and/or during Ubrary design. 

5.4 MOIETY TYPES 

In an exemplary embodiment of the invention, the moieties are selected to reflect the 
types of bonds that the drug is expected to make with the target. In an exemplary embodiment 
of the invention, the moieties are selected based on their chemical behavior. If a particular 
behavior is exhibited by several moieties, in an exemplary embodiment of the invention, only a 
smallest one of the moieties is selected. In some embodiments of the invention, multi-purpose 
moieties, which can bind to several different binding sites, are used instead of moieties which 
can only bind to one type of target site. The specificity of the moieties selected may depend, 
for example, on the total number of moieties, their size and their amenability for chemical 
processing. It should be noted that some of the moieties are dh-ectional, while others are non- 
directional. Where available, non-directional bonds may be preferred over directional bonds. In 
some exemplary embodiments of the invention, two levels of measurement are performed, a 
coarse resolution level and a fine resolution level. More specific moieties may be used during 
the fine resolution level of measurement. Additional details and methods for optionally 
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reducing the number of moieties used in some embodiments of the invention, are described 
below. 

Following is a list of moieties of which one or more may be attached to gauges: 

a. Hydrogen bond donor. Directional bond. 

b. Hydrogen bond acceptor. Directional bond 

c. Positive charge. Non-directional bond. 

d. Negative charge. Non-directional bond. 

e. Aromatic ring. Directional bond. 

f. Hydrophobic group. Non-directional in general, however, some, e.g., rings, may be 
directional with a preferred direction perpendicular to the ring plane. 

Different moieties may be used in other embodiments of the invention, for example, also 
providing one or more of Halogen, Carbonyl, Phosphate and Sulfate bonds. It should be noted 
that the different moieties may differ greatly in the their chemical afBnities or fliey may differ 
less or even slightly. In some exemplary gauge sets, the slight difference between moiety 
afBnities is used to fine tune a measurement distinction between bond types. 

With respect to the directional bonds, in some embodiments of the invention, it is 
assumed that the bond has sufficient spatial flexibility so that a small number, e.g., seven, 
different directional bonds will suffice to cover all the possible bond directions. Alternatively, 
smaller or greater numbers of bond directions may be used. OptionaUy, different directional 
bonds have different numbers of directions represented in a gauge library. The angular 
distribution of the directions may be, for example uniform, or it may be non-uniform, for 
example depending on the bond type. 

Several different sizes of hydrophobic bonds may exist In an exemplary embodiment 
of the invention, two sizes are selected and represented by different moieties. An aromatic ring 
may also serve, as an oversized hydrophobic moiety. Alternatively or additionally, an aromatic 
ring is used to match aromatic bonds with other rings and/or some types of hydrogen bonds. 

The above selection of moieties and directions results in 25 unique moieties, which can 
be attached to scaffolds. An exemplary set of moieties is described below. 

In an exemplary embodiment of the invention, a subset of the above moieties is used. 
Use is made of the rotational flexibiUty of hydrogen bond donors and/or receivers. Although 
such flexibiUty wiU generally reduce chemical bonding probability, the mass of a hydrogen 
atom used in a hydrogen bond moiety is sufBciently low that the reduction in probability may 
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not materially affecTOe results of the measurement method, a^east for some gauges and 
assays. 

Alternatively or additionally, rotational flexibility is allowed for aromatic rings. 
Although aromatic rings have a high mass, the large bond area of the ring compensates for the 
5 reduction in bond strength caused by allowing rotational flexibility of the ring. 

Alternatively or additionally, some polar bonds may be represented by a single moiety, 
such as OH, which can act as both a hydrogen bond donor and as an acceptor. 

Optionally, for example if chemical information can be done without, more general 
moieties are used and a smaller number of triangles in a Ubrary is spanning. 

10 5.5 OVERLAP OF MEASURES IN A SET 

In an exemplary embodiment of the invention, the triangle space as a whole is spaimed 
by providing a plurality of triangles, each with sufficient freedom in its parameters (e.g., bond 
length, chemical affinity), so that each triangular arrangement of binding points can be 
expected to bind to one of the triangles to a measvurable degree. Optionally, the coverage of 

15 each triangle in the triangle space overlaps with the coverage of other triangles, to ensure that 
no parts of the space are left uncovered. 

As will be explained in greater detail below, in an exemplary embodiment of the 
invention, a gauge library is designed such that each possible triangular arrangement of 
binding points appears in (or fits within the parameters of) more than one gauge. In some 

20 cases, exactly congruent triangles cannot be provided, instead, triangles that are roughly 
congruent are provided (e.g., similar moieties, side lengths). These congruent triangles may 
have the same coverage in triangle space or not. For example, assiuning same moieties, two 
triangles with the following side lengths are provided: (3, 4, 5) and (3.1, 3.9, 5.2) 
(measurements in angstrom. These triangles may, for example, cover the part of triangle space 

25 from (2, 3, 4) to (4, 5, 6). 

In some embodiments of the invention, at least some of the triangle space is spanned by 
a set of triangles with overlapping coverage. For example, for the same part of triangle space, 
the provided triangles are (2, 3, 4.5) and (2.5, 3.5, 5.3), which have overlapping, but different 
coverage. 

30 While overlapping is useful for various reasons, for example, as described below, it 

does increase the size of the library. When overlapping is provided, the reconstmction method 
used optionally takes the overlapping into account. 
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6. RECONSTRUCTION 

AftCT process 300 (Fig. 3) is repeated for as many gauges as desired, the measured 
affinities of gauges 400 to target 100 are optionally used to reconstruct a model of the spatial 
distribution of interaction areas 102. An exemplary method is described below. 
5 In an exemplary (theoretical) mapping process for a particular target molecule, which 

uses a 75,000 gauge library, it is expected that about 400 of the gauges will bind to the target. 
Due to repetition of triangles in the library and/or due to the overlap in coverage of non- 
congruent triangles in the exemplary library, the number of real triangles defined by the target 
area and bound to by gauges is expected to be smaller. Ih one (theoretical) example, the 
10 number of "real" triangles that are defined by the target area and bound to by gauges is 100 
different triangles. 

Taking for example a 10-point pharmacophore, such a pharmacophore may include, for 
example, 10*9*8/6 triangles, which is 120 triangles. In some embodiments of the invention, 
not all of these triangles are identified, for example, due to high similarity between triangles 
15 (below distinguishing ability) or due to lack of binding (e.g., due to steric clashes). The 10 
point structure can, of course be reconstructed with fewer than 100% of the triangles, 
especially of the missing triangles are missing randomly. For example, 50% of the triangles 
maybe sufficient. 

However, the actual situation is more forgiving. A typical pharmacophore may include 
20 20 points, of which, typically only between 8 and 10 need to be identified in order to provide 
good binding. Thus, any substmcture of the pharmacophore that includes 8-10 correct points 
can serve as a good starting point for dmg generation. Fewer identified points can also be 
usefiil, for example as described below. 

Although various methods may be used to reconstmct the layout, in an exemplary 
25 embodiment of the invention a two step method is used. First, the "real" triangles are estimated 
fi-om the results of the assay, optionally using a clustering algorithm. Then, a suitable layout 
using the triangles is found, optionally using a scoring based search algorithm or a clustering 
algorithm. In other implementations, a single step or multiple step method may be used. 

6.1 TRIANGLE EXTRACTION 

30 In an exemplary embodiment of the invention, this step of the process has two parts, 

however, in other implementations, this step has a single part or more than two parts. One part 

is detemiining which triangle measures matched. This part may be less than trivial, for 

example, due to the fact that each gauge includes multiple triangles. However, the repetition of 
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triangles between gauges may assist in differentiation. Another, optional, part of the process is 
determining the real distances involved, rather than those defined by a measure. For example, a 
real distance between two moieties may be 4.3 angstrom, while binding triangle measures have 
distances of 4 and S angstrom. In some embodiments of the invention, it is desirable to 
5 estimate the real distance, 4.3 angstrom, fix>m the binding results. Optionally, this is provided 
by the overlap in coverage of the different triangle measures. 

In an exemplary embodiment of the invention, the two parts of the process are provided 
in a single compound process, for example using clustering. Alternatively a two step method 
may be used. Optionally, an iterative method is used with an estimate of which measures 
10 bound being used to estimate real distances and the real distances being used to improve the 
earlier estimate of which measures bound. 

Fig. 5 is a flowchart of a method 500 of determining which triangles did bind to a 
target, in accordance with an exemplary embodiment of the invention. 

At 502, a space is defined for each type of triangle (defined by the moieties of the 
15 triangle). Each such space has three dimensions, each one representing a length of a side of tiie 
triangle. 

At 504, a notation is made in a space at a location {x,y,z} if a gauge including that type 
of triangle with sides of lengths {x,y,z} was shown to bind to the target. It should be noted that 
for two different scaffolds, exactly matching triangles may be difficult to generate. Listead, the 

20 triangles may be nearly matching, for example having slightly diff^ent lengths of sides. 

In an exemplary embodiment of the invention, the assay results are used as a binary 
input, there is either a bond or not. Altematively, for example if conformal changes are 
observed or there is a measure of activity and/or bonding, die bond strength may be 
represented by a continuous or multi-step amplitude, using a hit notation. 

25 In an exemplary embodiment of the invention, if a single gauge includes multiple 

triangles, a hit is marked in each one of the relevant spaces. Altematively or additionally, if a 
single triangle can match two different type triangles, for example due to overlap between 
moiety affinity, it is also marked in multiple spaces. Optionally, the amplitude of the marking 
is normalized to the number of spaces tiiat are marked by the gauge. Altematively or 

30 additionally, a different amplitude is provided in each space, responsive to an a priori 
probability of bonding. 

At 506, the hits notations are replaced by a spatial spread Amotion. In an exemplary 
embodiment of the invention, the spread fimction represents the probability of that triangle 
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forming a bond at different distances represented by the spread. Alternatively or additionally, 
the spreading is between spaces, for example, if two moieties overly in their afBnities. 

Alternatively, flie hit indication is provided originally as a spreading function. 

In an exemplary embodiment of the invention, the spreading is a defined as 

5 f^e^ 

where Ax is the difference between the lengths of the sides and ox is a value representing the 
difficulty in bending the molecule so that it can perform the bond. In an exemplary 
embodiment of the invention, ox is a function of x, for example ax = a-fx . In an exemplary 
application, parameter "a" is 1.414. Possibly, the spread function is non-uniform in space, for 

10 example, to reflect non-uniform characteristics of the bond. Optionally, at least some of the 
spreading functions are derived empirically, by binding gauges having controlled distances 
between bonds, with targets having known models. Alternatively or additionally, such 
empirical testing is used for other purposes, for example, to determine flexibility in bond 
length, multiple chemical affinity of moieties and/or symmetry of the spreading function. 

15 Optionally, targets are classified according to their flexibility as well. Optionally, in an 
iterative process, once a model is estimated, a flexibility of the target is estimated and/or 
decided, for example foma a table, and used to correct the spreading function used. 

The spread hits are then combined, for example by addition, and peaks are found in the 
result (508). In an exemplary embodiment of the invention, peaks are selected based on their 

20 shape. Altematively or additionally, peaks are selected based on their amplitude passing a 
threshold. This threshold can represent, for example, the number of triangles that need to bind, 
to indicate a possible match. The threshold may be the same for all spaces or it may be 
different. Optionally, the threshold and/or decision making method is selected based on the 
clustering statistics, for example fix>m a table of previous empirical results. Altematively or 

25 additionally, the threshold is selected so that a minimum number of matches be found. 
Optionally, if there is a large number of sub-threshold matches, a different gauge set is used for 
the binding process. It is noted that in some embodiments of the invention, for any given triplet 
of binding points there are generally about 12 triangles, or more, that can be expected to bind. 
For example, both a shorter side and a longer side are expected to bond to a pair of binding 

30 locations having an intermediate distance between them. In addition, each triangle type can 
appear multiple times, for example, three times in the set. Li some sets, each (or some) triangle 
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point in the triangle space is covered by 24 triangles - 8 triangle designs that have longer and 
shorter sides in various combinations, times 3, if each triangle is provided three times. 
Additional overlap may be provided by ambiguous moieties. 

Optionally, by analyzing correlation between spaces and gauges, some four-point 
geometrical matching (or higher) may be found as well. 

6.2 LAYOUT CONFIGURATION RECONSTRUCTION 

Fig. 6A is a flowchart of a method 600 of determining a spatial layout of binding 

locations from the results of the method of Fig. 5, in accordance with an exemplary 
embodiment of the invention. In an exemplary embodiment of the invention, the method 
comprises constructing all the configurations (e.g., three dimensional shapes) that can be 
constructed from the identified triangles and ranking the configurations using a scoring 
method, ultimately selecting the configuration with a highest score. 

At 602, all the possible configurations that can be constructed from the triangles foimd 
in Fig. 5, are constructed. Altematively to building computer models of all the possible 
configurations, in an exemplary embodiment of the invention, the configurations are generated 
ad hoc. For example, in conjunction with the scoring method described below, a configuration 
may be constructed, or its construction advanced, only if it is likely to have a usefiil score. For 
exan^le, once a configuration solution has a score below the highest foimd so far, that lower 
solution is ignored. 

hi an exemplary embodiment of the invention, the construction method is by building 
up a structure piece by piece. For example, a triangle is added to an existing configuration only 
if has a side length and/or moieties that match a side length with a pair of moieties on the 
structure. A threshold of size difference may be defined for allowing the matching of two 
sides. Altematively or additionally, a threshold of matching between moieties may be defined. 
Optionally, the moieties are required to match at the ends of the matching side, or to have an 
overlapping chemical behavior. Such thresholds may depend on the length and/or types of 
moieties and/or other properties of the gauges and/or the target. It is noted fliat a first gauge 
may bind to a particular binding location using a different binding method from a second 
gauge, as long as the binding location supports both binding methods. 

In an exemplary embodiment of the invention, the construction of a configuration is by 
sequentially selecting a triangle from the list of available (bond) triangles, until all the triangles 
are used at least once. Used triangles may remain in the list for repeated use. Altematively, the 
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configuration may be built up using modules, each of which is constructed &om sub-modules, 
and, ultimately, triangles. 

At 604, a score is calculated for each configuration. Such a score is optionally a 
heuristic value indicating the reasonableness of the assay results being derived fi-om the target 
having the configuration. Various scoring methods may be used. In an exemplary embodiment 
of the invention, the scoring method is based on the particular linking together of triangles in 
the configuration and/or on the probability of the triangles themselves being correct in the first 
place. 

In an exemplary embodiment of the invention, the score is a product of scores for each 
shared triangle side. In an exemplary embodiment of the invention, the score for a triangle side 
that is shared between two triangles is an estimated probability of the two sides of the two 
triangles binding to a same pair of binding locations. Jn an exemplary embodiment of the 
invention, the score is the product of the above spreading fimction, for the x, y and z axes. 
Alternatively or additionally, other, simpler scores, may be used, for example, based only on 
the difference in sizes of the sides. 

In an exemplary embodiment of the invention, the score does not depend on the lack of 
a triangle. For example, if a generated configuration includes a three point configuration for 
which no suitable gauge matched, it is not assumed that the configuration is incorrect, nor is 
the score reduced. Altematively, the score may be reduced responsive to the existence of 
triangles that are found in a configuration and not found on any matching gauge, for example, 
based on their coimt. 

Altematively or additionally, some configurations may be raled out based on heuristics, 
for example mles that describe what the layout typically looks like. Altematively or 
additionally, prior information is used to rale out some configurations, for example, a partial 
model or knowledge of a molecule that binds well to the target. 

At 606, the stracture with the highest score is selected as the map layout of the binding 
locations for the target. As noted above, 602-606 may be carried out as an iterative search and 
construction method, for example with stractures being built ad hoc as the search progresses 
and indicates a certain structure has a score above a threshold (and so will its dependents). 
Many suitable search methods are known in the art, for example, in the art of graph search and 
in the art of searching game trees (e.g., for chess playing programs). 
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6.3 RECONSTRUCTtON VARIATIONS 

In an exemplaiy embodiment of the invention, a target may have several active areas. 

hi an exemplary embodiment of the invention, the reconstruction is allowed to recreate a 
disjoint configuration structure, with each disjoint part representing a map of one target area. 
5 Optionally, such a reconstruction may be required even for a single active area, if enough 
triangles (e.g. gauge moieties) that interconnect the disjoint parts failed to bind (for various 
reasons) and/or were not available in the gauge set used, so that a continuous structure cannot 
be reconstructed from the triangles that did match. 

Optionally, the above reconstruction allows a triangle to appear only once in a 

10 reconstructed configuration. Even if a triangle actually appears twice (or more) in the real 
configuration, the redundancy of similar triangles will generally still enable the structure to be 
reconstmcted. Alternatively or additionally, a triangle is allowed to appear more flian once, 
however, this may affect the score, for example, reducing it. Alternatively, an iterative 
experimental approach, as described below, is used, to block part of the target (e.g., with a 

1 5 suitable antibody or small molecule drug) and see if the triangle still matches. 

Optionally, user intervention is allowed, for example, for viewing the final structure or 
several candidate structures. For example, if a determination cannot be made, a human may be 
requested to select among options, force certain matches and/or configuration parts and/or to 
remove certain possibilities from consideration, based on, for example human experience and 

20 judgment and/or additional infomiation about the target of various types. 

It should be noted that one possibly output of the clustering and/or shsqpe reconstruction 
methods is an input to an interactive process and/or to fiirther drag development. For example, 
the application of the above methods can show where more exact data is lacking for forming a 
complete result and/or where there are ambiguities between possible solutions. 

25 It should be noted that tiie resulting structure may have a mirror (e.g., symmetry) 

ambiguity, due to the sole use of triangles. Optionally, this ambiguity is solved by using at 
least one 4- or higher- point measure, optionally constmcted or selected to bind in only one of 
the possibilities. Alternatively or additionally, the effect of steric clashes is used to distinguish 
between the two possibiUties. Alternatively or additionally, prior information is used to 

30 distinguish between them. 

6.4 ALTERNATIVE RECONSTRUCTION METHOD 

Fig. 6B is a flowchart 620 of an alternative reconstruction method, using clustering for 

shape reconstruction, in accordance with an exemplary embodiment of the invention. 
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At 622, a triangle is selected from the set of found triangles, that were found to bind in 
the assay and clustering of Fig. 5. This triangle is used as abase for constructing a structure. 

At 624, a pair of triangles is selected from the remaining foimd triangles, such fliat the 
two triangles share a side with each other and each triangle also shares a side with a part of the 
5 structure (e.g., which two sides of the structure may or may not be sides of a same triangle, 
depending, for example on the implementation). When the triangle pair is added to the 
structure the structure grows by one point in space. 

624 is repeated (626) until no triangle pairs can be added. This completes one potential 
structure. 

10 Often, there are several possible choices to make at 624, for example, for selecting the 

triangle pair and/or for deciding where to add them. At 628, a tree of possible stmctures is 
performed, by repeating 624 and 626 for each possible choice of triangle pairs and their 
location. This process may be done, a priori, for example, by spawning multiple flireads each 
time multiple triangle pairs are available for selection and/or each time such pairs may be 

1 5 attached at different locations. 

At 630, 622-628 are repeated by selecting all possible triangles as bases, in tum (or in 
parallel). Alternatively, other methods of generating all the possible structures from the 
triangles may be used. Optionally, a pruning method is used, for example, if a structure is 
clearly unsuitable or unable to utilize a significant percentage of the triangles (e.g., 30%, 50%, 

20 70% or any smaller, intermediate or greater value), the structure is dropped. Generally, the 
greater the number of triangles allowed to be ignored, the easier it will be to provide a 
structure (e.g., even under noisy conditions) . However, fee structure will be less constrained 
by the assay results and may be less dependable. 

At 632, all substructures found in the generated potential structures. Optionally, only 

25 some of the substructures are foxmd, for example, only the largest or only those above a certain 
size. In an exemplary embodiment of the invention, the method applied is a maximum 
likelihood algorithm for finding a most likely structure. 

At 634, these substructures are clustered, with each point representing a stracture in 
which the substructure is found. In an exemplary embodiment of tiie invention, the clustering 

30 space is defined per triangle type (e.g., type of moieties on the triangle) and the space is 
spanned by the sides of the triangles. Thus, for example, a 10 point sub-structure of a 20 point 
structure is marked in a space that includes the same number of moiety types as the sub- 
structure, with a location in that space determined by the three Cartesian locations of each of 
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the points (e.g., 30 dimensions for a 10 point sub-structure). Various orientations are 
optionally dealt with by selecting a obtain triangle to be a base triangle having an orientation. 
Alternatively or additionally, the space is marked with structures in a rotationally symmetric 
manner (or thus analyzed) so that the results from different orientations may be compared. An 
5 exemplary algorithm is described in R. Nussinov, H.J. Wolfson, •'Efficient Detection ofThree 
Dimensional Structural Motifs in Biological Macromolecules by Computer Vision 
Techniques", PNAS, volume 88, pp. 10495-10499, December 1991, the disclosure of which is 
incorporated herein by reference. 

At 636, a best substructure is selected. It is assimied that if a substructure is common 
10 enough and large enougih it is both correct and useful. In an exCTiplary embodiment of the 
invention, a thresholding is applied to select only tiiose substructures with structures and 
clusters over a minimum size. Other selection methods may be used as well, for example 
scoring, for example based on accmnulated score of matching up pairs of triangles (this 
matching up may be thresholded during constructions, for example using a preset threshold). 
1 5 Alternatively, other methods of findmg a large common substructure are used. 

It should be noted that while the clustering method may generate a structure that does 
not use all the triangles and is not complete, a complete map of the pharmacophore is not 
essential for many embodiments of the invention, for example for lead generation and finding. 
7. ANALYSIS 

20 7.1 OVERVIEW 

The above process of measuring and reconstructing a target area can be used to provide 
a wide range of information. The quality of the information and its type can be of varying 
kinds. Following are exemplary types of parameters which may be used to classify such 
information: 

25 (a) Completeness. The information may be complete or partial, for example, a complete 

target area model or a model of only part of an area. 

(b) Factual or statistical. An example of factual information is an exact model. An 
example of statistical information is a set of relative probabilities for a set of possible models. 

(c) Independence. Infomiation may be independent of other information, for example, 
30 being an exact model or it may be dependent, for example a parameteric model whose exact 

value depends on additional information. In addition, information derived using the above 
methods may be used as partial information for a different process. 
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(d) Substantiation. The infonnation may be supported by oth^ infonnation or it may 
stand on its own or even be in conflict with other infomiation. 

(e) Positiveness. The information may be positive, in that it indicates what exists if is 
desirable, or negative, in that it can be used primarily to knock out certain possibilities. 

5 While the infomiation garnered may be about the binding locations, in some case, the 

infomiation is regarding the geometry of the target at non binding locations as well. As will be 
described below, for example, a geometrical structure can also affect the usefulness of a drug 
lead. 

In some embodiments of the invention the analysis is used to acquire information about 
10 the gauges themselves, for example, their relative binding afiSnity, and/or their chemical 
behavior (e.g., pH dependencies). Such information may be general or it may be for groups of 
targets, for example, different for different families of proteins and the same within a femily. 

As can be appreciated, such a widely varying, range of information is amenable to many 
methods of analysis, some of which are described below and to many applications, some of 
15 which are also described below. In particular, some exemplary analysis methods are directed to 
gamering further information about the target area and for error detection and analysis and 
some exemplary applications are integrated as part of a drug discovery process. 

In some case, the results of the analysis are integrated into the reconstmction as 
geometrical and/or chemical informatioa Alternatively or additionally, the infonnation is 
20 associated with the reconstruction and/or the target, for example, in a manner similar to that 
used for dmg leads. This manner generally depends on the type of database used for storing 
information. 

7.2 RECONSTRUCTION VERIFICATION 

Tn an exemplary embodiment of the invention, the error size and/or type of the layout is 
25 determined. In one example, the reconstructed layout is analyzed to generate theoretical 
binding values for the gauge set used. Differences between these theoretical binding values and 
actual binding values may be used to indicate parts of the layout which are not exact and/or to 
indicate a degree of inaccuracy of the layout and/or the reconstruction process as a whole. 

Alternatively or additionally, physical verification is applied, for example, by applying 
30 an additional testing method and^or assay library to select between altematives or for 
verification. 
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7.3 BINDING STRENGTH 

In an exemplaiy embodiment of the invention, the generated layout is analyzed to 
estimate the relative binding strengfli of binding points in the target area. In an exemplary 
embodiment of the invention, the reconstructed layout is modeled and theoretical binding 
5 values for the gauge set are calculated. Variation in the actual binding values may be partly 
caused by a reduced or increased afiGnity of target area. Such estimation is generally statistical 
in nature since there are many variables that affect binding probability. However, it is expected 
that if a bond length and type are known and the exact positioning of the gauge in the target 
area can be determined (e.g., and its energetic consequences), than at least a statistical analysis 
10 of binding strength may be provided. Optionally, a baseline is provided by analyzing 
molecules with known behaviors, or by comparing flie binding of different, but similar gauge- 
triangles. 

7.4 INTER'BOND INTERACTIONS 

In an exemplary embodiment of the invention, the analysis is used to determine an 
15 interaction between the binding of different binding points. For example, such an analysis can 
compare the contribution of a binding point to the binding of a certain gauge, as compared to 
what is expected (e.g., based on energy and other calculations) and/or as compared to the 
apparent contribution of that binding point to the binding of a different gauge. This may 
indicate, for example, the effect of the bonding to one interaction location on the afOnity of a 
20 neighboring interaction location. Optionally such interactions are estimated and/or modeled 
using a model of electronic charge distribution in the target. 

7.5 GEOMETRIC ANALYSIS 

For some purposes, and to some degree of accuracy, the determined layout can be 
considered to be a cast of the target area. In an exemplary embodiment of the invention, the 

25 geometry of the target area is analyzed. Additional information may be provided by 
determining which gauges did not buid or bound with a lower affinity (which, if the binding 
geometry was similar is assumed, in some embodiments of the invention, to be due to steric 
clashes). This may assist in further defining the geometry of the target area. It should be noted 
that some steric clashes can be predicted firom the geometry of the layout. Any failed binding 

30 which has no other apparent reason and should have matched the determined geometry, may be 
assumed to result firom a projection of matter that does not define a noticeable binding point. 
This is described in more detail below. 
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In an exemplaiy embodiment of the invention^ the geometric analysis is used to 
determine a size of entry hole into area 104 (e.g., where am)w 400 is shown in Fig. 4B). A 
small hole and/or certain moieties at the hole entrance may rule out the possibility of certain 
drug sizes and/or types. Altematively or additionally, the geometrical analysis is used for 
5 classifying the target, for example, based on the size of substrate that it might work on. In an 
exemplary embodiment of the invention, geometrical analysis (e.g., for substrate 
determination) is supported by chemical analysis of the moieties in target area 104. 
Determination of the geometry may also be useful in deciding what marking methods of small 
molecules and/or gauges may work (e.g., not to use large florescent markers, if the entry hole 
10 is small). 

It should be appreciated that in some cases it may be easier to reconstruct the geometry 
of a target area, rather than its chemical binding pattern or vice versa. 

7.6 DETERMINATION OF STERIC CLASHES 

In an exemplary embodiment of the invention, steric clashes are detected in the analysis 

15 process and/or used to provide additional geometric and/or chemical information about the 
target. Li an exemplary embodiment of the invention, st^c clashes during the binding process 
are determined by comparing the affinities of different gauges with same triangles. This 
comparison optionally takes into accoimt one or more of entry hole size, chemical behavior of 
the gauge, degree of matching to the binding geometry and/or other binding locations, Steric 

20 clashes are, for example, caused when the proximity or potential overlap of the gauge and the 
target molecule reduce the binding affinity. 

As the shape of the gauges is known and, in some embodiments of the invention 
relatively rigid, steric clashes may be expected to result from the non-participating moieties of 
the gauge and/or the scaffold itself. 

25 In an exemplary embodiment of the invention, the steric clashes are used to generate a 

map of locations near the target that interfere with gauge atoms, thus possibly indicating 
occupied (e.g., by atoms, electric fields) parts of the target, which do not, apparently caxise a 
binding interaction with any gauge, to a noticeable degree. 

In an exemplary embodiment of the invention, the map is used to provide further 

30 information about the shape of the active area m target 100. Altematively or additionally, the 
map is used for assisting in drug development, for example, by filtering out potential drugs that 
would have the same steric clashes. Optionally, some level of filtering can be achieved simply 
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by matching the drag geometry to the geometry of gauges that should have, but did not, bind 
well. 

Geometrical and/or chemical aJSinity analysis may also be used to detemiine a shape of 
the natural substrate of the target, for example, if it is not clearly known and/or to determine 
5 which part of the substrate is engaged by area 104. 

7.7 IDENTIFICATION OF CONTROL AREAS 

In an exemplary embodiment of the invention, the binding results and/or reconstruction 
are analyzed to detect one or more control area of the target. Generally, control areas do not 
bind to the "main" substrate of the target, instead binding to a separate hormone or other 
10 modifier molecule. This secondary binding typically affects the binding behavior of the target 
area. 

In an exemplary embodiment of the invention, control areas are idratified by their size 
and by their being disjoint firom a main target area layout reconstraction. Alternatively or 
additionally, control areas are identified by testing bindings with pairs of gauges (or in the 

15 presence of various molecules, optionally selected a-priori or after the detection of the 
presence of control areas) to detect intra-gauge binding dependence. Altematively or 
additionally, control areas are identified fi-om the shape of the reconstructed layout. 
Altematively or additionally, the presence of control areas is detected by there being left-over 
gauge bindings that are not needed and/or do not fit in the reconstruction. 

20 hi an exemplary embodiment of the invention, depending on whether binding to a 

control area is desirable or undesirable, the differential identification of control areas may be 
used for screening potential drag leads. 

7.8 OTHER MAP ANALYSIS 

The map or model of the target may be analyzed to yield other information, in 
25 accordance with exemplary embodiments of the invention. For example, as noted above, the 
distance of a binding point fix>m a control area or active area can afTect the type of drag 
developed. For example, a drag that binds in the control area may have an enhancing effect on 
the target, for example that of an agonist. A molecule that binds near the control area or active 
area, or inside the active area, may cause the target to be less sensitive to signals and/or 
30 incapable of acting, e.g., an antagonistic effect. Thus, in an exemplary embodiment of the 
invention, the location of the binding area on the target is used to assist in determining what 
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sort of therapeutic effect to expect from a developed drug. For example, a binding area near a 
target area may indicate a drug whose tail blocks access to the target area. 

In another example, binding areas that are outside the target area, can be used to 
enhance a drug design. A drug may be constructed (or discovered) to include parts that bind in 
5 the target area and parts that bind outside the target area. The combination of binding areas 
provides a binding strength greater than that provided individually by each area, while the part 
of the molecule bound in the target area can provide the desired therq>eutic effect. 
Alternatively or additionally, a molecule that binds to two separate areas may cause a 
conformal change or prevent such a change in the target molecule. 
10 8. USE IN DRUG DISCOVERY PROCESSES 

8.1 OVERVIEW 

Drug discovery is a very long and expensive process whereby drugs for curing diseases 
are foxmd. The process starts with idmti^ng a target to be affected by the dmg, finding 
potential drugs that affect the target and then determining which, if any of the potential drugs 

15 is safe and dependable. Oftra, no suitable drug is found and one of the drug candidates is 
modified in various ways in an attempt to make it more suitable. One cause of difficulty of the 
dmg discovery process is the difficulty in knowing what molecule will affect the target. As 
will be described below, in some embodiments of the invention, the methods of the invention 
are used to at least partly reduce this diflScxilty. Another cause for difficulty is the many 

20 unexpected side effects of potential drugs which render them unsuitable and/or xmpredictable. 
Again, as described below, some methods of the invention may be used to at least partly 
reduce this difficulty. 

Typically, drag discovery methods try to answer two questions. One, is there/what is a 
drag molecule that binds strongly and affects a target molecule. Two, how to ensure that these 

25 drag molecules have the proper ADMET profile (ADMET stands for Absorption Distribution 
Metabolism Excretion Toxicity) which translates into success in clinical trials. In an exemplary 
embodiment of the invention, the method, materials and/or apparatus described herein are used 
to select, design and/or aim towards potential drag molecules that have chemical 
characteristics that are known or believed to improve the ADMET profile. Lipinski rales are an 

30 example. It should be appreciated that by knowing which parts of the molecule are relevant to 
binding and which are not (as provided, for example by comparing a molecule to a model of 
the target), one can more easily modify (or plan in advance) potential drag lead compounds to 
bind tightly and/or meet any well defined qualities. 
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In general, the above methods and especially the various models of the target can be 
useful in selecting molecules or research methods that conform to the map and rejecting those 
that do not. Other uses of the above measurement process are also described below, for 
example using an additional step of mapping to check a theory. As can be expected, different 
5 methods (existing and new) of drug discovery may be affected differently by the use of 
exemplary embodiments of the invention. 

It should be appreciated tiiat various embodiments of the invention may be 
implemented in an automatic manner. However, due to the great cost considerations, in some 
embodiments of the invention, the application is semi-automatic, for example, using the 

10 methods to change the discovery process, for example, by adding a filtering step or a candidate 
generation step, while still allowing for the use of human judgment, for example, deciding if 
certain generalized and vague criteria are met. In some case, the entire process is controlled 
using a human intelligence, with various ones of the steps, for example, mapping and/or 
rejecting leads are performed manually. Changing thresholds and redoing a step is an example 

15 of a decision which may be reserved for a human, for example. 

8.2 DRUG GENERATION 

One relatively new type of drug discovery is actually drug generation, a new molecxile 
is designed to have a desired function. In an exemplary embodiment of the invention, the 
above chemical and/or geometrical map of the target are used to assist this process. For 

20 example synthesis may be assisted by showing what shape an active part of the dmg must have 
(or limit the range of possible shapes). 

In an exemplary embodiment of the invention, drug synthesis comprises taking gauges 
firom the library and modifying them, for example replacing moieties, so that they better match 
the target. In some cases, the replacement moieties have the same affinity but a different 

25 binding strength, for example, selecting NH2 or OH for a Hydrogen bond donor, and choosing 
an optimal size for a for hydrophobic moiety. It should be appreciated that an strength based 
classification of moieties may also used in the library construction, for example, providing 
multiple strengths of Hydrogen donors or multiple sizes or hydrophobic moieties. One possible 
use is to achieve a better uniformity of binding strength between moieties. Another is to 

30 provide a library with higher accuracy. 

In an exemplary embodiment of the invention, scaffolds and/or molecule parts for use 
in drug synthesis are constmcted as a result of target mapping processes. For example, by 
analyzing target geometries, a set of scaffolds that spans (with attached moieties) most of the 



wo 2004/010136 




PCT/IL2002/000614 



target spaces may be found. The specific scaffolds may be, for example, constructed firom sub- 
scaffolds or be selected from libraries of chemicals, for example using a process similar to that 
described below for finding gauges in libraries. Sets of moieties or moiety clusters may be 
selected, for example based on a statistical analysis of how moieties are clustered together in a 
5 family of targets or in a general list of targets. Optionally, the statistics are collected over the 
mapping of many targets. Optionally, the targets are selected to be exemplary for expected 
future targets. It is expected of course, that in some cases a perfect dmg will not be generated 
using such synthesis methods, but the synthesized dmg may be a good starting point for drug 
enhancement. 

10 8.3 LEAD GENERATION 

Often simpler than drug synthesis is lead generation, in which a lead, which is not 

expected to be a suitable drug, is generated and then enhanced and modified using processes 
known in the art In an exemplary embodiment of the invention, the map is used to describe a 
potential molecule, for synthesis as a drug lead. In an exemplary embodiment of the invention, 

15 the m£^ is used as a set of constraints and a search is made to find a molecule meeting the 
constraints. Additional constraints can be, for example, known synthesis methods, a base 
molecule form being used as a starting point. An exemplary software which may be used is 
LUDI sold by MSI (USA). The LUDI system operates by attaching basic chemical components 
together in order to obtain a required pharmacophore-match or other molecule. 

20 The potential molecule may then be synthesized and developed into a dmg, as is well 

known in the art. 

In an alternative method, a potential dmg molecule may be constmcted by linking 
together molecules of the gauge library or other molecules having suitable moieties or 
stmcture, so that the resulting molecule has a higher affinity than a single gauge. This 

25 molecule may then be optimized, for example by removing unnecessary moieties and/or 
adding moieties to provide various desired properties. Optionally, the gauges are attached 
using a scaffold, rather than directly to each other. Optionally, by analyzing what gauges link 
(e.g., using clustering), it is possible to achieve a better estimate of a desired size and/or 
chemical properties of the fragments to be linked. For example, the selection of two gauges to 

30 be linked together may be based on actual binding of additional (or other gauges), for example, 
2, 4, 5,. 6 or more gauges. For each such set of gauges that bind, a best gauge or otitier molecule 
is selected for the linking. Alternatively or additionaUy, higher specificity gauges are used to 
determine which of the several possible triangle measures of a gauge actually boimd. Such 
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higher specificity gauges may be generated, for example, by removing moieties fi:om existing 
gauges (or generating such gauges using any method known in the art). Such higher specificity 
gauges may also be used for other embodiments of the invention, for example, to improve 
clustering statistics. Generally, however, due to the relative large niunber of possible such 
5 gauges, they are used when there is a way to limit the range of possible triangles. Altematively, 
the large niunber of more specific gauges, for example, gauges with I, 2, 3 or 4 triangles are 
created for use as a library or as part of a library of gauges. 

In an exemplary embodiment of the invention, knowledge of the structure of the target 
is used to correctly locate the linker and/or chose a suitable linker that does not suffer steric 

1 0 clashes with the target 

In an exemplary embodiment of the invention, the gauges are selected for linking 
without actually constructing a model. Instead, the actually binding gauges are selected and 
linked together. Altematively, the model is used to decide which gauges to link and how to 
link them. Such a model may also be used in other lead-linking schemes, for example, as 

15 described in the background, to guide the choice of which firagments to link, what length of 
link to provide, where to attach and/or at what orientation to attach. Optionally, the lead is 
constmcted in steps fix>m the gauges, and each step is tested to see if it meets its expected 
behavior. 

Altematively or additionally, instead of using a model as provided herein, a difTerent 
20 type of model is used, for example a model of the target bound to a gauge, for example 
generated using X-ray Crystallography and/or NMR. This model, for example, generated once 
for each of a plurality of gauges may be used to decide what linking distance and type to 
provide when creating a lead firom gauges. Altematively or additionally, a new molecule may 
be designed and constmcted to have binding points at some or all of the locations shown by the 
25 crystallography model to bind, for example, for two, three or more gauges. In general, this type 
of method assumes that once the configuration of the bound target-gauge pair is known, an 
exact model may be unnecessary, since actual conformance information is available. 
Altematively or additional, as noted herein, a measurement may be made of the bond target. 

8.4 LEAD DESCRiPTION 

30 In an exemplary embodiment of the invention, the m^ is used to describe one or more 

profiles of molecules which would be expected to have an effect on the target. In an exemplary 
embodiment of the invention, the profiles generated take into account one or more of: 
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(a) geometry of interaction location layout; 

(b) afiSnity of interaction locations; 

(c) size of entrance into the active area(s); 

(d) identification of potential control area(s); 
5 (e) synthesizability; and 

(f) expandability, e.g., that additional moieties can be attached. 

Typically, a molecule requires at least five or six bonds to form a strong enough 
docking in the target, so as to affect the a target at nano-Molar concentrations. The exact 
number may depend, for example, on the affinity of the interaction locations. A single target 
10 will generally provide a large number of possible profiles. These profiles may be matched 
against libraries, for example, using methods known in the art. 

In an exemplary embodiment of the invention, the profiles are generated using a format 
that is matched for a particular search software and/or library data structure. In an exemplary 
embodiment of the invention, searching by pharmacophore is provided, for example as known 
15 in ISIS base firom MDL (when searching 3D databases). 

8.5 LEAD SEARCH 

In an exemplary embodiment of the invention, the map is used to search through a 
library of known molecules, for a possible match. Possibly, the map is used in place of 
analytical models of the target, in known virtual scanning techniques. In an exemplary 

20 embodiment of the invention, the library is pre-processed so that molecules in the Ubrary are 
described in terms of the moieties and geometries of the layout model and/or the gauges used 
in measuring the target. Alternatively or additionally, an existing library is pre-processed to 
yield a gauge-compatible description of its contents, for example, each molecule being defined 
as a parametric model based on measurement gauges. It should be noted that this description 

25 may not be a one-to-one mapping, for example, a same molecule may be described using two 
different sets of moieties, as there is some overlap between moieties chemical behavior. 

In an exemplary embodiment of the invention, potential leads are identified based on 
them includmg or being able to include a large number of moieties at the required positions, as 
indicated by the map. In one example, a search is made for 3 point or higher (e.g., 4, 5, 6, 7 or 

30 more) matches. In another example, each molecule in a library is tested for the number of 
moieties it includes in the required positions and for the availability of attachment points to 
attach missing moieties. In an exemplary embodiment of the inverition, the missing moieties 
are added one by one until a suitable drug lead (e.g., strong enough binding) is created. 
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An exemplary search is performed by ISIS base, by MDL. 

One possible type of search comprises going over all available 3D structures in which 
to search, breaking them down into sets and/or subsets of pharmacophore points and looking 
for a fit within a tolerance range defined in a query. 

5 8.6 LEAD REJECTION 

In an exemplary embodiment of the invention, the results of the above methods are 

used in rejecting leads that otherwise appear suitable. In one example, a lead (or family of 
leads) is rejected if the above models imply a lack of binding and/or steric clashes. In another 
example, an assumption is made that if a lead is suitable, a gauge corresponding to a triangle 
10 (or other number) of moieties on the lead is expected to bind to the target. If no such gauge is 
found or an analysis of the data implies that the probability of a triangular binding of the three 
moieties in a gauge is unlikely to have happened, the lead is rejected, or subjected to additional 
scrutiny. Alternatively or additionally, a matching of a certain gauge may also indicate the lead 
is unsuitable. 

15 In one example, workers in the field can use the information provided to determine if a 

certain lead is likely to be capable of being optimized (reasonably). For example, one expects 
that by directly adding or removing specific moieties (e.g., what is often considered to be a 
main type of small changes in a molecule) affinity can be significantly improved (often at least 
3-4 orders of magnitude are required). By knowing what the changes should be or could be 

20 (e.g., where additional points need to be added, which information may be provided by some 
embodiments of the invention) one can see if one specific lead can undergo the required 
changes, e.g. has putative attachment points in the right positions. Specific gauges (e.g., that 
bound) will indicate what the required changes could be. 

These methods may also be used to reject certain lead modifications provided during 

25 lead enhancement processes. 

It should be noted that some lead rejection methods do not require all possible gauges 
and/or triangle measures to be available. Rather, even a partial library is useftil, for example 
for rejecting certain leads. In one example, a partially-spanning library is used generating a 
partial map (e.g., part of space, disjoint and/or not all binding points), which can be used to 

30 reject some leads and/or indicate potential suitability of others and for search. Further, even the 
binding or failure of binding of a single gauge may indicate suitability or lack of suitability of a 
lead. Generally, due to the imcertainty involved in all chemical processes at this time, 
decisions are not made on the basis of a single binding assay. 
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8.7 TARGETED MAPPING 

In some embodiments of the invention, gauge binding is assayed during the discovery 

process. In one example, the binding is used to test a theory or an assumption regarding the 
leads. For example, if a certain lead is expected to be suitable, at least one of several particular 
5 gauges may be expected to bind. Leads may be ranked, for example, based on how well such 
targeted binding is. Altematively or additionally, a part of the layout may be remapped as a 
result of the discovery process. For example, the discovery process may indicate conflicting 
evidence of the layout structure. In another example, a higher resolution mapping of part of flie 
layout may be required, for example, to more exactly detemiine the distance between two 

10 moieties. In some cases, instead of assaying with a fiill set of gauges, gauges are selected based 
on them being most likely to bind (or not) to tiie particular desired parts of the layout. For 
example, if the distance between two points on the layout needs to be determined, gauges that 
are less likely to bind at other points of the layout are selected. In another example, the 
moieties used are more specific, for example, having a more limited repertoire of chemical 

15 behavior and/or have a greater directionality. This may require using a different scaffold. 
Possibly the gauges used for such remapping have fewer triangles per gauges, for example 
between one and three, to reduce unexpected binding probabiUty. Altematively or additionally, 
gauges are selected so that steric clashes prevent binding in imdesired locations. In some cases, 
these gauges are not in the basic mapping library used for initially determining the layout. In 

20 some cases, the required gauges are synthesized ad hoc, rather than selected from an existing 
library, . 

as TARGET SUITABILITY TESTING 

In an exemplary embodiment of the invention, the map is used to determine the 

suitability of a target to be a target for a drug. A suitability value may be, for example binary or 
25 . it may be graded (discrete or continuous). In some embodiments of the invention, a suitability 
value is not a scalar, for example, being a vector, with each element of the vector indicating a 
different aspect of suitability. A similar stmcture may be used to indicate suitability of leads 
and potential dmgs. 

One example of the use of target suitability testing is where there are multiple potential 
30 targets. For example, in some diseases, there is a possibility of selecting between a plurality of 
target proteins, or selecting different parts in the chain of protein synthesis (e.g., DNA 
transcription, protein-chain creation, protein folding, protein post-processing and protein 
deployment). Some of these potential targets may be imsuitable. 
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In an exemplary embodiment of tiie invention, the map can be analyzed to detect such 
suitability, for example, by rejecting targets with an active area that is too large (for some types 
of treatment). The size of the target area can be detected from the layout geometry. 
Alternatively or additionally, a target may be deemed unsuitable due to its having a too 
5 generally active (non-specific) target area, which can be determined, for example, by analyzing 
the specificity of the determined target layout. Alternatively or additionally, a target may be 
deemed unsuitable, because its active area that has very weak affinities (e.g., a large drug 
molecule with many binding points may be required). Alternatively or additionally, a target 
may be deemed imsuitable due to its similarity to a housekeeping protein. This similarity may 

10 be detemiined by comparing the layouts of the target with those of known housekeeping 
proteins. Similarity to any human protein may assist in determining potential side effects ahead 
of time. In lead grading, a lead may be evaluated based on a probability of its interfering with a 
housekeeping protein, which is optionally determined by checking the binding of a lead to 
model layouts of housekeeping proteins. 

15 In an exemplary embodiment of the invention, a database of layouts of housekeeping 

proteins is provided, such a database may be provided using methods known in the art. 
Alternatively or additionally, at least part of the database is provided by systematically 
moping housekeeping proteins. Alternatively or additionally, at least part of the database is 
provided by generating **worst case" target area geometries or a range of possible geometries 

20 for the active areas, based a knowledge of the structure of substrates that are acted on by the 
protein. Such a worst case target area may also be used as prior information to assist in 
deciding which of several reconstructions is correct. 

8.9 TARGET PARTITIONING 

In an exemplary embodiment of the invention, the map is used to identify parts of the 

25 target as bemg potential "exact" targets, and on which the drug discovery method can be 
focused. Although the target, as a whole, is what is being affected by the drug, it can be 
affected in many ways, for example, different drugs may block different parts of an active area. 
Alternatively or additionally, some drugs may cause conformal changes. Alternatively or 
additionally, some dmgs may interact with control areas on the target. Alternatively or 

30 additionally, some drugs may be agonistic, while some antagonistic. Alternatively or 
additionally, some binding areas may be useful for staging (e.g., as a base for attaching 
molecules closer to a target area), rather than direct activity. Binding areas may be classified 
based on the type of effect that may be expected fix)m a molecule binding to those areas. This 
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classification may be, for example, manual. Alternatively or additionally, automatic 
classification may be provided, for example, based on target template structures (e.g., which 
indicated for a certain class of protein what each area of the protein might do). 

Optionally, potential control areas that can change the target, are identified. Possibly, 
such control areas are identified based on a binding in a binding assay. Optionally, a model of 
the target is used to assess whether binding at the potential control location can cause 
conformal changes, for example, based on the proximity of interaction locations on different, 
adjacent parts of the protein. 

In an exemplary embodiment of the invention, the active areas are segmented into 
different "exact targets" based on the sub-areas that have a potential for drug interaction, for 
example, based on their geometry. Alternatively or additionally, segmentation is based on 
selecting such sub-areas that are not common to similar sub-areas of housekeeping proteins 
(e.g., segmentation into special and common binding areas). 

8.10 DRUG AND LEAD ANALYSIS AND ENHANCEMENT 

Jn an exemplary embodiment of the invention, the above layout is used for analyzing 
existing drugs or drag leads, for example, to assist in improving or reengineering the drug or in 
screening. 

In an exemplary embodiment of the invention, the layout is used to determine which of 
a plurality of target areas on a target are interacting with the drug or which target fi^om a 
plurality of possible targets are mteracting with a given drug. This methodology may be used, 
for example, to analyze the effect of drags whose operation method is not clear. 

In another example, the drag is analyzed to determine which part of the drag binds to 
the target. This can serve as a basis of a process for modifying the drag, in which the binding 
parts of a drag are retained and other parts of the drag are modified. Alternatively or 
additionally, when modifying the drag, care is taken not to distort the active part of the drag so 
that it does not bind, or distort the drag as a whole so that steric clashes are caused. 

It should be noted that a single drag may interact with two different targets in a 
desirable manner, each target interacting with different, possibly overlapping, parts of the drag. 
Such activity of a drag is optionally determined by comparing the drag stracture to that of the 
targets. 

In some cases, the exact spatial and chemical properties of the drag (or a protein 
substrate) are not known. However, by determining the layout of targets which bind to the 
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drug, the spatial and chemical layout of the active part of the drug (or substrate) may be 
estimated. 

In another example, the layout is used to determine the pharmaceutical activity of 
synthesis byproducts. When a drug is produced using a particular process, various byproducts 
5 are produced as well, some with a beneficial activity and some with a non-beneficial activity. 
In an exemplary embodiment of the invention, the structure of such byproducts is compared to 
target areas of the target and of housekeeping proteins in an attempt to estimate what side 
effects they migjit cause. A process for drug manufacture is optionally selected or rejected 
based on a thus-estimated activity of the byproducts, given that the type and amount of 
10 byproducts produced by a particular process can be determined. Alternatively or additionally, 
such comparison may be used to assist in improving a production method and/or in deciding 
which synthesis parameters to use. This testing may also be used for regulatory purposes, for 
example to approve or disapprove generic drugs. 

8.11 DRUG SELECTION 

15 In many cases, there may be multiple dmgs which can treat an illness. Knowledge of 

which target (and housekeeping proteins and/or other human proteins) is affected by a drug 
and how it interacts can be useful in selecting between alternative treatments, in preventing 
side effects, preventing or controlling drug-interactions and/or in selecting treatments for 
diseases that no exact drug has been selected for, for example exotic tropical diseases and 

20 some viral diseases. 

In an exemplary embodiment of the invention, the layout of a target is used to select 
which of a plurality of available drugs or dmg leads appears to be most suitable for interacting 
with a the target. Jn the case of drugs, this may allow selecting altemative treatment protocols. 
Also, in some cases, knowledge of the interaction method will assist in selecting those times 

25 and/or associated protocols and/or drug combinations at which the drug is most effective 
and/or has minimal side effects. 

Alternatively or additionally, drugs may be designed to interact with multiple targets. 
For example, a lead that interacts with multiple targets (e.g., of a same or different disease or 
syndrome) or target area portions may be awarded a higher scoring for further processing that 

30 other leads. 

A possibly related use is the finding of a new use for an old drug and/or assisting in 
determining how to modify an old drug for a new iise. For example, when searching for leads 
that match a template, a search may also be made through databases of dmgs, to see which 
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drug has a structure that is prophesied by the modeling process to provide good binding. 
Existing drugs, in general, have the other properties (ADMET). 

8.12 DRUG ENHANCEMENT 

As noted above, knowledge of the interaction method and/or problems of interaction 
5 with a target area, can assist in modifying a lead to become a drug. Alternatively or 
additionally, such knowledge may be put to use in enhancing an existing drug and/or 
modifying a drug to interact with a target related to an existing target. By comparing the 
layouts of the two targets, for example, possibly useful changes in a drug may be determined. 
Alternatively or additionally, the layout of the target area may be used to assess problems with 
10 the binding of the drug to the target (e.g., too strongly or too weakly) and/or deteraiine the 
effect of modification of the dmg on such binding behavior. In an exemplary embodiment of 
ttie invention, the potential drug when bound is assessed against the model, to detemiine if a 
moiety exists that can be theoretically added, which will bind to another point in a binding 
area. 

15 Altematively or additionally, drag enhancement comprises enhancing a drug to match 

more than one target, or a variety of target mutations, for example including one moiety to 
bind for one mutation and one moiety to bind for another mutation, for example, in HIV some 
proteins have two main varieties and countiess sub-varieties. This enhancement may interfere 
with other properties of the drag, but the tradeoff may be considered useful. 

20 Altematively or additionally, a drag may be designed to bind to a subset of binding 

points that is common to a plurality of targets or mutations, for example, models of the 
plurality of targets are analyzed to determine shared binding points. The various drag 
discovery methods are then optionally applied assuming that only these binding points exist. 
Real assaying of a potential drag may be carried out on the multiple targets to ensure that the 

25 various modifications of the drag did not make it fail to bind to one of the targets. 
Altematively or additionally, when a modification is made, it is determined whether the 
modified drag will bind to the common binding locations and/or have steric clashes. It should 
be noted that there might be other reasons to discover a drag that binds only to a subset of the 
possible binding points, for example, if a mutation is expected in one of the binding points 

30 and/or to allow the drag to work even if an interfering molecule is bound to one of the binding 
points. 
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8.13 DRUG FAILURE ANALYSIS AND REENGINEERING 

Often, a drug will come out to market and then fail. The method described h^in may 

be useful in determining a reason for the failure and then possibly assisting in rescuing the 
dmg. In an exemplaiy embodiment of the invention, the layout of the target of the drug and/or 
5 other proteins that the drug is believed to have interacted with (e.g., based on the type of side 
effects), are generated. The drug is then compared to the targets to determine failures in 
binding to the correct target and/or undesirable binding to non-targets. It should be appreciated 
that while such comparison may be theoretically possible using other means, it is believed that 
prior to the availabihty of target mapping, such large scale molding of active areas of targets 

1 0 was not practical, due to time and cost limitations. 

In an alternative embodiment of the invention, it is noted that a dmg may be suitable 
for only part of the pubUc, for example, due to individual differences. In an exemplary 
embodiment of the invention, the genes that express inadvertent targets and/or targets are used 
to reconstruct models or samples of the targets and then map the active areas of the models. 

15 The results may show that an individual has a sensitivity to the drag and/or that a different 
individual is resistant to the effects of the drug. Altematively or additionally, the testing may 
be done against pathogen strains, to determine differential sensitivity to drags. In some cases, 
the genetic differences are linked to already known markers, for example sensitivity to sulfates 
is linked to a G6PD deficiency for sulfates, so that the classification of people as being 

20 compatible with the drag may be simple. Alternatively, a genetic test may be appUed prior to 
selecting which drag to use on a person. 

8.14 ADDITIONAL DRUG DISCOVERY RELATED ANALYSIS 

Additional analysis methods may also enhance a drag discovery process. For example, 

many drags have side effects due to their interaction with housekeeping proteins or proteins 
25 that cause feeling of malaise if interfered with. Examples include GI proteins and Uver 

proteins. Some drag targets are known to be similar to such proteins. In an exemplary 

embodiment of the invention, models are generated for such potential side-effect generators. 

Any potential drag lead is rejected (or scores lower) if it is shown to bind to one of these 

prohibited models. Altematively or additionally, drags that have a known side effect are 
30 analyzed to determine which protein they bind to and this protein and/or the particular binding 

locations are used for defining a prohibition of binding of a potential drag. 

In another example of an analysis, potential drag molecules are analyzed to see if they 

bind as a substrate to certain enzymes. Such binding may indicate a speed of incapacitation of 
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a drug or its excretion. Alternatively or additionally, such binding may be useful for identifying 
pro-drugs, that are activated by their interaction with certain enzymes, such as liver enzymes. 
In this case, a drug may include two sets of active areas, one for activation of the drug and one 
for binding of the drug to its target. Optionally, biding to a protease (or other manipulating 
5 protein) is ensured by adding binding moieties or gauges to a drug molecule at suitable 
locations. 

In another example, a set of target molecules that are all known to be affected by a 
same protein or molecule are analyzed to determine of they have common binding geometries 
to which the molecule bonds. This may help, for example, in fine tuning the molecule to bind 
10 more selectively, for example, by adding a moiety which will interfere with other target 
molecules and/or assist in binding to a particular target molecule. 

8.15 STREAMLINE DISCOVERY PROCESS 

As can be appreciated a discovery process typically includes going througih various 

dead ends. In an exemplary embodiment of the invention, mapping of the targets is used to 
15 select parts of the discovery process that are likely to fail and prevent them &om being 
attempted. Some examples (some of which are described elsewhere in this application) 
include, dropping targets that do not seem suitable for improvement, identifying targets likely 
to have side effects and weeding out libraries. In an exemplary embodiment of the invention, 
weeding out existing libraries is performed by removing from a library leads that have an 
20 expected low probability of binding and/or appear redundant to other molecules. For example, 
a molecule that is very flexible is less Ukely to bind. The probability of binding may be 
estimated, for example using energetic considerations based on the molecule's degrees of 
freedom. 

8.16 UTILITY GENERATION 

25 While many proteins and molecules are catalogued, many of them do not have a known 

utility. Determining an exact utility for a protein or a molecule may require a very large 
expenditure. In an exemplary embodiment of the invention, potential utilities for molecules 
and for proteins may be generated on a large scale in the following manner. A molecule may 
have a utility as a gauge or it may have a utility as a lead or dmg. In an exemplary embodiment 

30 of the invention, existing target area layouts, for example, 10, 50, 100, 1000 or any smaller, 
greater or interaiediate nxmiber are matched to the molecule to see if binding is likely. It is 
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expected that many molecules will turn out to have a potential utility. In general, more 
matching is more woik, but increases probability of success. 

In a similar manner, mapping proteins provides an indication of its active area shape, 
potential substrates and/or potential drugs which might affect it. In an exemplary embodiment 
5 of the invention, a utility is found for a protein by determining its substrate. Optionally, the 
protein active area layout is compared to structures of known substrates and proteins. 

In this manner, a library and individual dmgs and proteins may be said to have an 
expected utility. For example, the protein may be for one of the following protein families 
GPCR's, Proteases, Kinases, Ion Chaimels messenger proteins or any type of peptide or other 
1 0 macro-molecule found in a living organism. 

9. EXEMPLARY DISCOVERY APPLICATIONS 

9.1 OVERVIEW 

In this section existing discovery methods will be described, as well as possible 
modifications that take into accoimt the methods described herein. 
15 While many approaches to drug discovery are known, the following two main 

approaches generally encompass the existing methods. 

9.2 SCREENING BASED DRUG DESIGN 

This discovery method works by screening a target against a large number of molecules 

and then attempting to enhance any matches to produce a dmg. The process is as follows: 
20 (a) Provide a general hbraiy of compounds for screening, equally relevant to all target 
proteins. Typical sizes of such libraries grow constantly at roughly one order-of-magnitude 
(factor of 10) per decade. Current typical sizes are 1-10 million. The libraries are often 
proprietary and maintained by each corporation independently. 

(b) Screen the corporate library against the chosen target. Look for compoimds exhibiting 
25 at least weak activity (significant activity at concentrations typically 1-100 yM) of the type 

required with relation to the target. 

(c) If no hit is foimd, the process ends here. Apparently, this is often the case, possibly m 
above 70% of the cases. If hits are found, an optimization stage is initiated, in which the final 
outcome is expected to be a compound with strong activity (at concentrations typically nM) 

30 against the target. This is done in one or a combination of the following two methods: 

1, In case there is only one hit or all hits are variations of one molecular theme, a large 
number of analogues of the hit are synthesized. This group of compounds is sometimes known 
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as a "focused library*'. These are also screened against the target protein. The purpose here is 
to define a direction for increasing the activity of the original hit by identifying chemical 
moieties and positions on the original hit that increase activity. This process is known as 
developing a QSAR (Quantitative Structure Activity Relationship). 
5 2. If a number of chemical groups have been identified as hits, a computational process of 
identi^dng possible pharmacophores (molecular substructures directly involved in binding of 
the hits to the target) is executed. These may indicate not only possible directions for 
optimization, but also their feasibility for a given molecular starting point (both firom a 
physical point of view and fiom a synthesis point of view). 

10 (d). Drug like qualities are generally a byproduct of this process. Molecules in the initial 
screening library are often chosen to possess dmg-like qualities. During the optimization 
process, only partial information is available so that simultaneously satisfying drug-like 
requirements and increased activity are seldom under direct control. Final drug-candidates that 
may result fi-om this process closely resemble hit compoxmds in the initial screening library. 

15 (e) Testing. The dmg-candidates are tested, for example in live animal models and then in 
humans, to determine there efficacy. Many drug candidates* fail at this point and lacking any 
basis for modification, fail completely. 

In an exemplary embodiment of the invention, the above described inventive methods 
may be used to improve the above drug discovery process, for example one or more of: 

20 (a) Hit rate. As stated above, in most cases, no hits are found for a new target. By 

generating a mq)ping of the target, the leads used for screening can be better selected. Even 
leads with very weak affinity may be selected for further improvement, due to the combined 
indication of very weak activity and matching a map. Alternatively or additionally, the method 
of designing a gauge library is applied to a molecule library, to reduce duplication and to assist 

25 in ensuring coverage of binding space. This may be done, for example, by analyzing the library 
to identify gauges in triangle space and/or uneven distribution leads in this space. In addition, 
excessive overlapping may be determined. Alternatively or additionally, the library may be 
analyzed to determine molecules that are unlikely to ever bind, for example, due to them 
having excess flexibility and no known binding partners. Alternatively or additionally, if the 

30 screening is in stages, molecules may be selected for each stage based on them having less 
overlap with each other. 
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Alternatively or additionally, some binding results may be ignored, for example, 
molecule with high flexibility may add too much noise (binding to many molecules in many 
ways) and therefore be ignored, at least in a first stage of processing. 

Altematively or additionally, the gauges that bind can themselves be used as leads (and 
5 many of such bindings are expected). Often the gauge library is small compared to the 
corporate library and can be added to it with a relatively small penalty. In an exemplary 
embodiment of the invention, results from the **old" library will serve as initial starting points 
for optimization (as before) but optimization will be directed by information gained fiiom 
screening using gauges. Possibly, a gauge library binding assay is performed on a target with 

10 an interacting lead. This type of assay can be used to detemiine if the lead (or molecule from a 
Ubrary) is interacting with the active area or not (e.g., based on whether and the extent that it 
affects the binding of the gauge library). This assay may be compared to an assay performed 
with other binding leads and/or with no bound leads at all The effect of lead chemistry may be 
determined by checking the assay in the presence of one or more chemically similar but non- 

1 5 interacting leads. 

(b) Process directing. If the target is mapped and a lead starting point is known, 
there are still many ways of enhancing the lead to produce a drug. In an exemplary 
embodiment of the invention, knowledge of the target geometry and/or chemical behavior is 
used to assist in directing the modification process, replacing physical experiments with virtual 

20 ones and/or assisting in culling out (probably) useless leads. In addition, it is noted fliat various 
combinatorial generation of lead modifications can be simplified by selecting only those lead 
modifications that are meaningful (or are most meaningfiil) in view of the target layout and/or 
based on the three-dimensional structure of the leads (e.g., by checking which triangles are 
exhibited by which lead and by which lead modification). Optionally, a mismatch between the 

25 results predicted by the determined layout and actual binding activity of the leads may be 
useful in correcting the layout, better understanding the chemistry of the lead and/or predicting 
other leads that might show promise. 

(c) Dmg recovery. Even if a drug fails the final testing stages, in an exemplary 
^bodiment of the invention, the above methods may be used to determine the reasons for the 

30 failure and/or provide guidance in reengineering the dmg. 

9.3 ALTERNATIVE SCREENING BASED DRUG DESIGN 

Chemical genomics or chemogenomics have lately become very popular. They are 
based on the idea that instead of first finding a target first and then finding a compound for it, 
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the opposite process is applied: first screen compounds against whole cell assays looking for 
the phenotypic result (e.g., selective death of cancer cells). Then, once an active compound is 
found, the target is sought. One possible advantage of this approach is working in parallel on 
multiple targets, many of which may not even be known. However, existing screening libraries 
5 cannot guarantee finding hits. In an exemplary embodiment of the invention, a gauge library as 
described herein is used and is expected to have a plurality of gauges that interact with the 
cells. While the interactions may be weak, a non-trivial number of such interactions may be 
expected. 

9.4 STRUCTURE-BASED DRUG DESIGN 

10 This method assumes that accurate modeling software for simulating molecular 

processes is used. The process is as follows: 

(a) Obtain an accurate and detailed three-dimensional structure of the target protein. 
Usually done via X-ray crystallography or NMR analysis (both experimental). Computational 
^proaches also exist, but are generally not accurate. 
15 (b) Identify the active site in the protein structure (not always straightforward for new, 
unfamiUar targets). 

(c) Identify relevant binding points in the active site, also known as pharmacophore points. 
These are points where weak (non-covalent) binding can occur. A potential Ligand must 
satisfy a number (usually 6 or more) of these points simultaneously in order to achieve nM 

20 af^nity. 

(d) Design molecules that "fit" the active site, both geometrically and in terms of satisfying 
enough pharmacophore points. Both this stage and the previous are done using "docking" or 
molecular-mechanics type simulation software. 

In an exemplary embodiment of the invention, the herein described inventive methods 
25 may be used to improve the above drag discovery process, for example one or more of: 

(a) Linked structure. 3D structures of proteins are apparently, in many cases, of 
little use in and of themselves. Much experience has shown that it is difiScult to design strong 
binders based on this (e.g., geometrical) information alone. In an exemplary embodiment of 
the invention, it is noted that useful information is present in 3D structures of the target with 
30 bound ligands. While such ligands are not known initially, in an exemplary embodiment of the 
invention, gauges that bind to die target are used in place of such ligands, with the expectation 
that a significant mmiber of such binding gauges will be found. In an exemplary embodiment 
of the invention, the gauge binding process is applied and then the target is modeled (e.g., 
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using NMR or X-ray crystallogr^hy), possibly several times, with different gauges linked. 
The sh^e of the target area with the linked gauges is expected to be usefiil for designing 
strong binders using methods known in the art. Possibly, the known methods may be modified, 
for example, to combine the results of different configurations caused by different binding 
5 locations of different gauges. Optionally, the provision of multiple binding gauges (e.g., 5, 10, 
25, 50, 100 or any smaller, intermediate or larger number) will assist in determining the 
binding mode(s) of the target, possibly enhancing the understanding by providing partial 
binding modes as well. In general, the provision of more gauges, means more woric, but may 
enhance the accuracy of Hie analysis. 
10 In an exemplary embodiment of the invention, the linked stmcture results from a 

plurality of gauges are combined, for example by super position with the target as a reference. 
This superposition may yield a total model of the binding area of a target and/or fully bound 
configuration, rather than a partial one might be provided by each gauge. 

(b) Comparison. In an exemplary embodiment of the invention, the shape of the 
15 active area determined by the simulation model is compared to the shape of the area as 

determined by the mapping process. Differences between the two may assist in correcting the 
mapping/reconstmction method or it correcting the simulation model. Optionally, the 
simulation model is used to select between alternative reconstruction and/or to assist in fine- 
tuning a reconstruction, for example, by assisting in calculating more exact distances and/or 
20 indicating which possible moieties could be taking part in the binding. 

(c) Identification of binding points. In general, modeling software is not accxirate 
enough to predict binding points in a protein target. Also active areas may be difficult to 
identify. This is especially the case for novel targets. In an exemplary embodiment of the 
invention, the above methods circimivent one or both of these problems by identifying 

25 potential binding points/modes experimentally, e.g., using a standard assay library of gauges. 
Then these active areas are analyzed in greater depth using docking software, for example to 
predict the affinity of new compoimds to a specific target. 

9.5 MODULAR ASSEMBLY OF LIGANDS 

This method, which is apparently used by Simesis inc., works by constructing leads 
30 from parts that show affinity. The process is as follows: 

(a) Synthesize a finite library of elementary molecular firagments that include a "linker 
port" (i.e. a site on the molecule at which linking can be easily implemented). These are 
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typically small molecules previously identified as phannacologically "interesting", and which 
are amenable to including the standard "linker port*'. 

(b) Screen the elementary fragments against the target protein, looking for extremely 

(--ImM) low affinity. This step is typically problematic. 
5 (c) Link groups of two or more fi-agments via their "linker port** components in order to 

achieve increased affinity. The distance between two firagments, i.e. the length of the linking 

chain» may be varied and optimized. 

In an exemplary embodiment of the invention, the herein described inventive methods 

may be used to improve the above dmg discovery process, for example one or more of: 
10 (a) The elementary fi:agments are currently not designed in the art using any logic 

that may be viewed as exhaustive, i.e. typical diversity metrics are used (as in standard 

screening libraries) but these do not yield a finite list. Consequently, hits are seldom found (for 

general targets), even less than for general screening libraries, probably due to very low affinity 

expected, which poses many technical problems (e.g. solubility). In an exemplary embodiment 
15 of the invention, the set of firagments is selected based on spanning the space. For example, 

fi-agments may be pairs (or triplets) of moieties, having distances and moiety types selected to 

span the possibility space. 

(b) Geometry, i.e. the proper distance and orientation between two weakly binding 

moieties, is totally absent from the initial screening results in the art. In the linking stage, only 
20 very limited geometry variation may be tried (i.e. the length of the linker). In an exemplary 

embodiment of the invention, the binding of a gauge library is used to provide geometrical 

hints (or a complete model) which assist in deciding how to put together fragments, which 

fragments to put together and what distances to set between the fragments. This may also assist 

in determining what type of linker to use when linking fragments. This may also be used for 
25 synthesizing a new molecule that includes the binding parts of the binding gauges, spaced 

j^art by a suitable structure (e.g., a variation on a known drug), 

10. EXEMPLARY NON-DISCOVERY USES 

The above measurement methods may also be applied to uses other than drug 

discovery. A different gauge set may be required for some uses. 
30 In one exemplary embodiment of the invention, the measurement methods are used to 

assess toxicity, for example, to identify housekeeping proteins that may have adverse 

interactions with a certain drug or potential toxin. This may be usefiil in determining toxicity 

of industrial or household chemicals. 
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In another exemplary embodiment of the invention, the measurement methods are used 
to predict antibody afSnity to a material and/or cell, for example by identifying binding sites 
on an antibody and/or a material. 

In another exemplary embodiment of the invention, the measurement methods are used 
5 to map the outside of an organism, for example, a virus, rickettsia bodies, worm, protozoa, 
fungus, ameba or a bacteria. This may be useful in the development of vaccines. For example, 
a vaccine is often more effective if it is made from a protein whose shape does not change. By 
determining which parts of the binding areas on the outside of a pathogen do not change, such 
determination may assist in selecting a particular protein from the pathogen for vaccination use 
10 and/or to assist in assessing the chances of creating a useful vaccine. In order to prevent auto- 
immune responses, the active areas of existing vaccine material may be mapped, to see if the 
pattem resembles that of bodily proteins to too great an extent. It should be noted that this 
matching may be dependent on an individual's genetic material. 

Alternatively to absolute measurements, in some embodiments of the invention, the 
15 above methods are used for determining relative measurements, for example, for measuring 
conformal changes in a protein, imder different conditions. A same (or different - e.g., to 
match new expected measurements) binding assay may be applied to the protein under 
different conditions. Possibly, more flexible gauges and/or less stable gauges are -used for this 
application. 

20 In anotho: exemplary embodiment of the invention, the above measurement method is 

used to find new agricultural chemicals, such as insecticides and herbicides that are target- 
specific by affecting proteins known to be crucial only for some types of pests or weeds. 
Altematively or additionally, artificial hormones are developed to match targets in plant cells. 
11. USING PRIOR INFORMATION 

25 The above process has been described, in some examples, as a blind process, which 

assumes a neutral starting point of substantially no knowledge about the target. In some cases, 
there exists prior knowledge about the target, gleaned from various sources and/or by previous 
measurements of the target. Such prior information may be used in many ways. Following are 
some examples. 

30 In an exemplary embodiment of the invention, the prior information is sufficient to 

propose several altematives. A binding assay with the gauge library, with or without 
reconstruction may provide enough information for selecting between the altematives, for 
example between alternative models of which part of a lead interacts with a target or selecting 
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between two target area layout reconstructions. Optionally, to this end, the gauge set can be 
reduced to only those gauges that will distinguish and/or that are needed by either one of the 
models. 

In another example, crystallography, NMR, IR spectrum and/or chemical properties of 
5 the target are used in the above reconstruction process, for example, to resolve ambiguities 
and/or to overcome lack of data. In one example, these methods show how one or more gauges 
actually bind in the target. In another example, these methods or other prior knowledge are 
used to force a certain structure to be reconstructed, rather than following the above described 
score based reconstruction. For example, forcing the stmcture to include a certain sub-shs^e 

10 (e.g., a tetrahedral portion) that would not otherwise be reconstructed from the assay data. 

La another example, if part of the target is known, it can be reacted with a substrate that 
blocks out that known part, so that the measurement will only apply to the unknown portion. 
Alternatively, the statistics of interaction in the known portion may be used to assist in 
associating binding statistics with stmcture in the unknown portion. For example, a computer 

15 model or an analogue target may be used to provide an estimate of which gauges bind and at 
what strength, to the known portion. In the assay results analysis, gauges that bind to the 
known area are ignored, not used in the assay and/or their binding strengtti reduced during the 
analysis. Optionally, a gauge is not removed from consideration if removing it will leave no 
triangles of a certain size and/or moieties for binding to the unknown area. Alternatively, the 

20 library as a whole is used, for example, as noted above that simultaneous screening using 
100,000 assays at a time, is a current technology. 

La another example, when an iterative measurement method is used, prior information 
may provide insight into desirable starting points. 

Optionally, the prior information is used as an input for modifying the binding process, 

25 for example by varying the binding environment. 

In another example, the prior information is used to set the environmental conditions 
used during measurements, for example, using information from previous assay attempts with 
a similar protein to indicate what environmental conditions are likely to provide bindings 
and/or at least not interfere. 

30 La an exemplary embodiment of the invention, prior information is used for the design 

of specific scaffolds, moieties and/or gauges to better measure a particular target. The 
molecules may be, for example, designed ad hoc, and/or a sub-library constructed by selecting 
previously known molecules. In an exemplary embodiment of the invention, a scaffold is 
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selected for such a sub-library due to a small (e.g., 0.5 A) difference in a side of a triangle due 
to the change in scaffold. In a regular mapping process, such a difference may not be 
important, but in high-resolution mapping, for some targets (e.g., where binding is weak) it 
may be important. Similarly, a set of gauges may be provided to cover a certain range of sizes 
5 and/or chemical behaviors at a finer resolution. 
12. ITERATIVE MEASUREMENT 

In some ways similar to the use of prior information, iterative measurement allows 
information form a previous measurement step to be used, for example, to better tune a current 
step or to reject certain possibilities. 

10 Jn some embodiments of tiie invention, instead of a one step measurement process, for 

example as described in some of the embodiments above, an iterative measurement method is 
used. In one example of this method, a lower resolution reconstruction is generated. Then 
additional assaying is performed, using a same or different gauge library and a higher 
resolution reconstruction is provided. The earlier reconstruction may be used, for example, as a 

1 5 starting point for the reconstruction process and/or to assist in selecting which gauges to use in 
the additional assaying. In an exemplary embodiment of the invention, such an iterative 
method is used, for example, when the cost and/or time to perform a single complete assay are 
large. 

hi an exemplary embodiment of the invention, an iterative measurement uses more 
20 flexible gauges (explained below) in a first set of measurement than in a second set of 
measurements. Alternatively or additionally, a different subset of gauges is used for the 
different sets of measurement. 

The difference between the stages may be in correctness of the reconstruction, for 
example, which interaction locations lie where. Altematively or additionally, the difference 
25 may be in accuracy, for example, in the distance between two binding locations or the bond 
angle of an interaction location. In an exemplary embodiment of the invention, the above 
assumptions of range coverage, for example, for hydrophobic bond sizes and for directional 
bonds are made stricter in later reconstruction it^tions, for example, providing 15 directional 
bonds. However, not all the measurements may need to be redone. Instead, only those gauges 
30 that bond to interaction locations that are expected to change in the model, are used. Various 
search methods known in the art may be used to assist in providing and/or determining 
convergence of the assay and reconstruction process, for example, hill-climbing. 
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13.1 OVERVIEW 

Various uses of gauges are described above, some of which may use a complete gauge 
library (e.g., completely spamiing and having sufBcient resolution) and some which may, 
alternatively or additionally, use a partial library. One or more of several issues are optionally 
considered in the design of such libraries. Exemplary such issues and considerations that may 
optionally be used when designing and/or selecting gauges, gauge designs and/or gauges sets 
are described below. It is noted that some of the issues relate to the properties of the individual 
gauges and some to the properties of the gauges as a set. The design (and/or selection) of a 
complete set of gauges may address multiple issues and various tradeoffs, for example as 
shown in the exemplary gauge set described below. These issues are explored below. In 
general, it should be noted that even some of the gauges in a gauge set are not useful, this does 
not generally detract from the usefubiess of the gauge set as a whole. 

Fig. 4A showed an exemplaiy gauge 400. A typical gauge set includes a large plurality 
of gauges. Possibly, all the gauges share a basic common design, as will be described below, 
however this is not essential. In addition, there can be many gauges, gauge designs and gauge 
sets that are useful for measurement. 

In an exemplary embodiment of the invention, a significant portion of a gauge set is 
based on permutations of a small number of basic molecules, called scaffolds. In this design 
method, a scaffold includes a plurality of attachment points and each gauge is created by 
selecting a scaffold and mounting various moieties at the attachment points. One potential 
benefit of this approach is that fewer different chemical processes are required for synthesizing 
a library. Another potential benefit is that the generated library has more predictable chemical 
behavior, reflected, for example in the environments used for assaying. Another potential 
benefit is that a more predictable and/or controlled set of distances between moieties may be 
achieved. Another potential benefit is simplicity is designing a spanning library. Another 
potential benefit is that it is easier to ensure spanning in a library or library portion. Another 
potential benefit is using this type of permutations (possibly with scaffolds novel to the library) 
supports generation of missing or desired measures, ad-hoc. In one case, for example, new 
gauges with particular distances are generated by modifying an existing scaffold. It should be 
noted that not all these potential advantages are expected in every embodiment of the 
invention. 
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It should be s^preciated that for a given library, parts may be based on scaffolds, while 
other parts are generated using oflier means, for example, selection form an existing molecular 
library and/or constructed using various molecular construction, design and synthesis methods 
known in the art for attempting to custom create molecules with certain properties. Further, the 
5 entire Ubracy can be non-scaffold based. It should also be appreciated that not all scaffold- 
based libraries provide all, some or even any of the above potential benefits. 

13.2 SCAFFOLD 

In Fig. 4A, gauge 400 is shown to include a scaffold 402, to which four moieties are 
attached, at four of possibly more potential attachment points. In an exemplary embodiment of 

10 the invention, gauges 400 are selected to span a range of distances between moieties. In an 
exemplary embodiment of the invention, by varying the locations of coxmection of moieties 
among available attachment points, different inter-moiety distances are fixed for a single 
scaffold. A greater range of possible values is optionally achieved by providing a range of 
possible scaffolds. It should be noted however, that no scaffold is required, per se. Rather, it is 

15 expected that at least for some embodiments of the invention, it may be more cost effective to 
create a library combinatoricly using scaffolds. This is exemplified in Fig. 4B, where the gauge 
is shown as a triangle defined by its moieties and the distance between them, without any 
reference to the sceiffolding. 

However, in an exemplary embodiment of the invention, a scaffold is provided on 

20 which multiple different gauges are constmcted. A plurality of different or same moieties may 
be selectively attached to different locations on the scaffold, using relatively standardized 
methods of combinatorial-chemistry, thus creating a range of gauges, possibly having generally 
known chemical properties (e.g., solvency, v^or pressure, stability). 

In some embodiments of the invention, the scaffold(s) is selected so that it does not 

25 extend to or out of the triangle shape(s) defined by the moieties. Alternatively or in some 
cases, the scaffold and/or some of ttie moieties do interfere with the binding, and may cause 
steric clashes. By providing a range of scaffolds, steric clashes may be avoided for some 
gauges and/or the causes of the steric clashes may be determined. 

In some embodiments of the invention, the scaffold geometry and/or chemistry is 

30 meaningful. 

Optionally, the participation of the scaffold in the provision of binding triangles is 
ignored in the design of the gauge set. Alternatively, the scaffold chemical activity is noted 
dijring the design of the set, for example, for providing one or more moieties. Optionally, the 
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effect of the scaffold on providing binding, repelling and/or interfering bonds, is considered 
during reconstruction or analysis. Alternatively or additionally, the geometry of the scaffold is 
taken into account during analysis, e.g., to determine causes for steric clashes. 

Altematively or additionally, triangle binding analysis ignores any binding triangles 
5 that are probably not exposed to the target (e.g., based on gauge geometry). 

13.3 VOLUMETRIC GEOMETRY OF GAUGES 

Triangles, as a rule, define a plane, which may or may not be the plane of the scaffold 

(if any). In an exemplary embodiment of the invention, when gauges are selected for inclusion 
in a library they are selected so that their attached moieties lie in a plane or in some other 

10 desirable conformity. A planar arrangement has a potential advantage of preventing multi- 
stable (e.g., conformal changing) molecules from being included, which is not desirable in 
some embodiments of the invention, as they may confuse the analysis and/or reduce the 
binding probabilities. Possibly, a set of gauges is provided, to cover a range of possible non- 
planar orientations. In some embodiments this is more desirable than selecting a molecule that 

15 exhibits conformal changes. Molecules with conformal changes may be excluded using other 
methods as well, for example, by analyzing each potential gauge. Altematively or additionally, 
the gauges are selected so that the dimensions of the gauge or of particular triangles in it do not 
change, even if other parts of the gauge exhibit conformal changes. Optionally, a certain 
triangle in a gauge may be neutralized by making it energetically unlikely to bind, for example, 

20 by ensuring that that triangle exhibits conformal changes or adding flexibility to the bonds of 
one or more of its moieties. It should be noted however, that such exact modification of a 
gauge may not be possible, for example, due to the small size of a gauge or its possible effect 
on other parts of the gauges and/or other triangles. 

13.4 FLEXIBILITY 

25 The flexibility of a gauge can adversely affect one or both of the amount of information 

provided by the gauges matching and the affinity of the gauge to the target. While it is true that 
flexible molecules are more likely to find an arrangement of points to bind to, increased 
flexibility may, at least in some cases, reduce the overall probability of binding of a molecule, 
for entropic reasons. In addition, the binding of a flexible molecule provides less precise 

30 information than the binding of a rigid molecule. 

Thus, although a greater number of interaction location layouts can be matched using a 
flexible gauge, in an exemplary embodiment of the invention, at least some relatively rigid 
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gauges are selected for the gauge library, so that the measurements using these gauges are 
more precise. Optionally, substantially all gauges in a gauge set are substantially rigid. In an 
exemplary embodiment of the invention, the gauges are translationally rigid, in that the 
distance between moieties does not change much. Alternatively or additionally, the gauges are 
5 rotationally rigid, in that the relative orientation of the moieties does not change. Optionally, 
flexibility extends to chemical specificity of the moieties, for example, by selecting moieties 
that are either more or less specific. For example, one can chose moieties that have only one 
function (i.e., for hydrophobic chose tert-butil or a non-aromatic ring (e.g. cyclohexane) or for 
hydrogen bonds avoid using a hydroxyl (OH) (which is both a donor and accqptor), or vice 
10 versa. 

In an exemplary embodiment of the invention, however, a small degree of flexibility is 
provided, for example to ensure overlap between gauges. In one example, the degree of 
flexibility is sxifficient so that a pair of moieties in the target can be matched by multiple pairs 
of moieties in the gauges, with different distances between them. In an exemplary embodiment 

15 of the invention, the gauges are designed such that each distance between moieties in the target 
can be matched both by a gauge that has a slightly longer distance and by a gauge that has a 
slightly shorter distance. The degree of flexibility may be defined so that a relatively low 
amoimt of energy is required to bend or stretch the gauge so that it can match the moiety layout 
in the target. The relevant energy levels may dqpend, for example, on the assay sensitivity, on 

20 the gauge concentration and/or the assaying enviroimient. 

Optionally, at least a small number of the gauges are flexible, for example to 
compensate for gauges that are not available. For example, as noted herein, rotational 
flexibility may be allowed for hydrogen bond participants and/or aromatic rings. Alternatively 
or additionally, flexible gauges are used to assist in providing coarse level information which 

25 may be later fine-tuned using rigid gauges. Optionally, the reduced amount of information 
(e.g., by lack of binding and/or less precision) is compensated for by the redundancy of the 
gauges and triangle measures in the gauges. 

It should be noted that particular method of determining which triangle bound, 
described above, provides a significantly greater weight to rigid triangles. It should be noted 

30 that in a single gauge, triangles may have different rigidities. 

In an exemplary embodunent of the invention, the Catalyst software firom Accehys 
(formerly MSI) is used to assess the rigidity of a gauge. 
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Id an exemplary embodiment of the invention, at least 20%, 40%, 60%, 80% or any 
smaller, intermediate or larger percentage of the gauges are rigid. In general, if more rigid 
gauges are used, they are easier to analyze using tiie methods described herein. However, such 
gauges may not be available and/or it may be desirable for various reasons to use non-rigid 
5 molecules, for example, if such molecules are similar to drugs or have other properties 
believed to make them suitable for screening. 

In an exemplary embodiment of the invention, a substantially rigid molecule (or bond) 
is defined as a molecule which has a single entropic configuration and, in which, except for 
hydrogen atoms, no bond changes by more than 1 A using less than 20kCal/Mole. Alternative 

10 embodiments of the invention may allow less rigidity, for example allow greater movement, 
such as 0.8 A, 1.5 A, 2 A or any greater, smaller or intermediate value, at 10 kCal/Mole, IS 
kCal/Mole, 30 kCal/Mole, 40 kCal/Mole or any smaller, intermediate or greater application of 
energy. It should be appreciated that absolutely rigid molecules are generally not possible. 
Instead, the term "substantially rigid" is used in the claims. As the molecules become less 

15 rigid, they may bind with more difficulty and be less specific in the meaning of their binding. 
However, less rigid molecules may be easier to obtain and/or use to ensure coverage, for 
example. 

Typically, rigid molecules are those for which all single bonds are either part of a ring 
or attach "end'* atoms i.e. at one of their ends (e.g., single atoms or simple moieties such as 

20 NH2, for which rotation is uninteresting in some cases). Once the ring grows too much, for 
example beyond 5 or 6 atoms in some cases, the ring becomes flexible. Larger rings may also 
be rigid, for example, if there are never more than 2 adjacent single bonds whose atoms 
participate only in single bonds (i.e. if any of the atoms in the ring are themselves attached by a 
double bond to an atom that is not a member of the ring, this also may rigidify that segment of 

25 the ring). A single covalent bond is rotationally free, unless it is part of a ring. 

13.5 GAUGE LENGTHS 

In an exemplary embodiment of the invention, the gauge sides lengths (i.e., the 

distances between the center of mass of the moieties) are selected to cover a range of expected 

distances between interaction locations and/or dimensions of small molecule drugs. 

30 Alternatively, for example, for non-small molecule dmgs, a different range may be selected 

than for small molecule drugs. In an exemplary embodiment of the invention, the selected 

range is between 2 A and 12 A. la another example, the range is to under 10 A, or under 8 A. 

Alternatively or additionally, the range is fi-om above 3 A or above 4 A. In some cases, an 
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"outer length" or an "inner length" may be usefid, which are defined &om the outside or inside 
of the moieties taking part in a triangle. 

In an exemplary embodiment of the invention, the sampling is selected to uniformly 
sample an energy cost required for a molecule to accommodate the sampling resolution. For 
5 example, if a first triangle side is x A and a second triangle side is y A, the range of distances 
covered by the first side should require a same amoimt of energy to modify the molecule to fit 
the range, as the range of distances covered by the second side. Generally, this means that as 
the molecule is larger, the binding range, for a same amount of energy, increases. Optionally, 
the allowed amount of energy is a parameter of the assaying process, the target and/or the 

1 0 gauges used, for example, to allow a detectable binding by the gauges. 

Li an exemplary embodiment of the invention, the range is covered by intermediate 
sizes, so that at least one gauge will match each intra-moiety distance, for each pair of 
moieties. Alternatively or additionally, at least two gauges or gauge sides are similar in moiety 
geometry. Alternatively, only two gauge sides match. Different environments may dictate a 

15 different niunber of gauges, for example, some bonds may exhibit more flexibihty at one 
temperature, but not at another. 

The sampling of distances by the gauges may be even along the range or it may vary, 
for example being exponential and/or stepped, due to the effect of the changing scaffolds 
between triangles, to achieve different triangle side lengths. 

20 It should be noted that some sets of side lengths caimot be combined in a single 

triangle, due to the required relationship in a triangle, namely, that the sum of lengths of any 
two sides be greater than the length of the third side. 

13.6 ENVIRONMENTAL STABILITY 

In an exemplary embodiment of the invention, the gauges are applied to the target 

25 imder normal physiologic conditions, including controlled pH, temperature and ionic content. 
They may thus be selected to perform correctly only in the standard environment. 

However, in some embodiments, the testing range may not match the physiological 
conditions normally present. In a particular example, a desired property of a drag may be 
activity at hyperthermia temperatures or for patients with a fever and not at normal 
30 physiological temperatures. 

A special set of gauges may be used for non-physiological conditions, for example 
replacing some gauges with others. Alternatively or additionally, a relatively stable set of 
gauges may be provided, which exhibit a same behavior over a wide range of environments. 
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Alternatively or additionally, even if the gauge properties change, if the change is known and 
spanning is retained, the reconstruction method may be adjusted (e.g., the locations and/or 
amplitudes in triangle space) to account for environmental effects. 

Another possible environmental variable is the type of solvent used, as some gauges 
5 may not be very soluble in water, so assaying may use non-standard solvents. 

In another example, the target may exhibit conformal changes, which are desired to be 
measured, under small changes in the environment, such as the concentration of calcium ions. 
It may be desirable that the gauges do not exhibit the same sensitivity as the target protein to 
die changes. 

10 Alternatively or additionally, the gauges may be designed or selected to change in 

different environments, thus, for example, allowing a single gauge to make multiple 
measurements, each at different environments. 

13.7 UNIQUENESS OF GAUGES AND OVERLAP OF SIDES AND TRIANGLES 

As alluded to above, two different gauge-sides lengths may match a particular 
15 interaction location configuration, for example, by an interaction location being capable of 
binding to two different moieties and/or due to flexibility in the gauges (and/or the target), 
which caimot be completely eliminated. 

In an exemplary embodiment of the invention, the overlap between gauge 
measurements is controlled to be substantially constant over the gauge space. Altematively or 
20 additionally, the overlap is minimized. Altematively, at least a minimum amount of overlap is 
encouraged, for example to compensate for various eventualities where a gauges does not bind 
or an assay fails or to provide additional linking information. 

It should be noted that even if substantially rigid gauges are used, there is a level of 
tolerance inherent in the interaction, so that some freedom is always available, albeit, possibly 
25 at the expense of binding strength. 

If the degree of overlap is known, its effects can be compensated for in the above 
reconstmction method, for example during clustering. Altematively or additionally, if an 
expected degree of overlap does not exhibit expected effects, the measurement is suspect. 

In an exemplary embodiment of the invention, however, a large degree of overlap is 
30 provided, for example a factor of two, three or more repetition of triangles. Fractional overlap 
may be provided, for example, by using moieties that have non-orthogonal aflOnities (in the 
detectable range) and/or as a result of partial overlapping between triangles. Generally 
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however, an exactly same triangle will not be repeated, for example, due to differences 
between scaffoldings and/or effect of other moieties within a scaffolding. 

Thus, altematively or additionally, to accidental overlap, some or all triangles are 
repeated between gauges. In an exemplary embodiment of the invention, this rq)etition is used 
5 to compensate for the effect of steric clashes and/or other unexpected chemical behavior 
exhibited by some of the gauges. Altematively or additionally, the repetition is provided to 
assist in determining which triangle boimd, based on the binding of gauges. To tiiis effect, the 
gauges may be selected so that there is a lesser overlap between gauges with respect to the 
other triangles the two gauges include. It s^pears, however, that if the scaffolds are sufficiently 

10 different, the probability of most of the triangles in one scaffold overlapping with most of the 
triangles in another scaffold is small. This may assist in distributing the overlapping between 
different scaffolds and gauges. Altematively, similar scaffolds may be used, so that a greater 
degree of overlapping of triangles of same gauges may be provided. It should be noted that part 
of the overlap is provided by the fact that the gauges may have some degree of flexibility, so a 

15 same triangular array of binding points can be matched by triangles of different sizes. In one 
exemplary embodiment of the invention, the Ubrary is designed so each triangular array of 
points can be matched by at least one larger triangle and at least one smaller triangle. This 
overlap may be in addition or instead of repetitive type overlap where a substantially same 
triangle is provided at least twice. 

20 Optionally, the order of moieties in a particular scaffold is controlled to account for 

expected steric clashes, for example, to assure that at least some triangles will not have the 
same steric clash problems as other triangles. 

Altematively or additionally, a mixture of gauges, having same triangles, but different 
expected steric clashes may be mixed in a single assay, to help avoid the steric clash problem. 

25 In an exemplary embodiment of the invention, while triangle overlap in general and are 

not exactly the same, the gauge triangles of at least some of the library, for example, 20%, 40% 
60% or any smaller intermediate or larger percentage, are selected so that distribution of 
triangles in triangle space forms a relatively discrete grid, with clusters of triangles near grid 
points. Altematively, at least some of the library, for example, 20%, 40%, 60% or any smaller, 

30 intermediate or larger percentage, is selected so that the coverage of the triangle space is 
relatively uniform, with less clustering. As noted above, overlap may be useful to overcome 
various causes of non-binding. However, greater overlap may mean a larger library. 
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It should be noted that overlap degree need not be uniform. For example, certain 
triangle sizes maybe more prone to steric clashes (e.g., if they all use large scaflFolds), in which 
case a greater overlap may be provided. Optionally, the clustering methods take the degree of 
overlap into account, for example to determine a threshold for deciding if a triangle was 
5 boimd. 

13.8 GAUGE MASS AND SIZE 

In an exemplary embodiment of the invention, the gauges are selected to have a 

minimal mass. It is expected that as mass increases, a gauge is more energetic and less likely to 
bind. Alternatively or additionally, greater mass often means greater size and more chance for 
steric clashes. In an exemplary embodiment of the invention, the scaffolds are selected to have 
a ynass under 200, not including moieties. Possibly, the increases mass of benzene ring 
moieties is offset, at least in part by their enhanced affinity. Altematively or additionally, 
gauges are selected by size, for example to be no larger than 4 fusen rings in size (e.g., about 
10 A). Altematively or additionally, when selecting a molecule for inclusion as a gauge, the 
selection is failed if the molecule is too large or too massive. It should be noted that in some 
case, the size considerations are relative. For example, it is desirable in some embodiments of 
the invention that a triangle have sides on the order of a size of a scaffold. Small triangles on a 
large scaffold may be ignored when considering the triangles contributed by a particular gauge, 
and possibly forced to be provided by a smaller scaffold. 

It should be appreciated that these examples are not limiting and a gauge may be larger 
and/or have a greater mass or be limited to be smaller and/or have a smaller mass, depending 
on the application or implementation, for example. 
14. PARTICULAR AND GENERAL GAUGE SET DESIGN 

14.1 EXAMPLE SPANNING LIBRARY SIZE 

25 Under certain assumptions, the following is an estimation of the number of gauges and 

triangles in a complete spaiming Ubrary for small molecules on protein targets. 

Assuming the range of lengths to be covered is 9 A^, at steps of 1 A**, the number of 
possible triangles is 10*10*10/(2*3) (factor of 2 for triangle in equality and factor of 3 for 
rotational degeneracy. Assuming 10 moieties and moiety directions, gives about 166,000 
30 triangles. Assuming an overly factor of 3 and 5 triangles per gauge, gives about 100,000 
gauges. These numbers are of course only exemplary, but may serve to clarify the following 
description of library design. 
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It can be seen fhat the size of the library depends on tiie triangle space to be spanned, 
the degree of accuracy, complexity of gauges and the degree of overlap. Any of these may be 
varied in accordance with exemplary embodiments of the invention, for example, yielding 
libraries with between 10,000 or fewer gauges and 1,000,000 or more gauges. Exemplary 
5 intermediate library sizes include 30,000, 60,000, 80,000, 200,000 and 550,000 gauges. In 
addition a library may include non-gauge elements or may form part of a much larger 
screening library, for example as described above. In general, the more gauges in a library the 
more work it is to apply as a whole. However, greater accuracy, specificity and coverage may 
be available as the library size increases. 

10 An example of smaller gauge libraries, are those that have only 7 moieties, reduce the 

sampling distance to 8 and/or reduce the overly factor to 2. Smaller and larger libraries and/or 
other modifications of library parameters, can also be provided in some embodiments of the 
invention, as well as various partial libraries. 

In another example, all gauges are designed to include a single triangle (or a small 

15 number), in which case about 166,000 gauges are needed (if there is no overlap). In such a 
specific-gauge library, the initial clustering step is optionally omitted. However, it is noted that 
gauges will generally include, at least inadvertently, more than one measure, so that clustering 
may still be usefiil. In some cases, a moiety is provided on a gauge to prevent the scaffold part 
of the gauge fi-om participating as part of a measure and/or to reduce the number of diflferOTt 

20 triangles provided by a particular gauge. 

14,2 GAUGE SUBSET SELECTION 

A particular type of gauge library is a subset library, which may be smaller tiian a 
standard library (but it may be larger, for example, if it has a higher resolution of lengths 
and/or moiety types). 

25 In an exemplary embodiment of the invention, only a subset of all the gauges are used 

for a particular measurement. In some cases this is because of the use of an iterative approach, 
which does not use all the available gauges at every step. Altematively or additionally, it may 
be desired to reduce the nimiber of assays performed. Altematively or additionally, this may be 
the result of a large overlap between different gauges. In an exemplary embodiment of the 

30 invention, gauges are selected to better operate in an environment (e.g., temperature, pH, 
solvent used) and/or exhibit fewer adverse interactions with the target and/or the assay, for 
example, in a cellular assay. Altematively or additionally, this may be the resuU of a failure to 
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create a complete spanning library, for example as shown in the example above which may be 
nearly universally useful for all protein targets of small drugs. 

It should be noted that one potential advantage of rigid gauges is that the geometry of 
many rigid molecules is minimally affected by environmental changes, even if their chemical 
5 behavior is affected. This may allow the gauge set to be more universed. 

In an exemplary embodiment of the invention, gauges for the subset are selected based 
on the target type, for example, the expected range of distances between the interaction 
locations. 

Alternatively or additionally, the gauges are selected responsive to a measurement 
1 0 need. For example, if a certain interaction location has an unknown size but is known to have a 
weak afGnity, a denser sanq>ling of the moiety size range may be used for that interaction 
location (e.g., for gauges that are expected to bind to that location). 

Alternatively or additionally, the gauges are selected responsive to knowledge of the 
available dmg types, for example, the types of possible hydrogen bond directions in the drug. 
15 Altematively or additionally, the gauges are selected to better distinguish between two 
potential dmgs, by providing better resolution for the differences between the drugs. 

hi some embodiments of the invention, the gauges are selected so that an 
approximately correct model can be reconstructed, even for those parts of the target for which 
lower resolution gauges are used. Altematively, the gauges are selected to determine if a 
20 certain dmg should bind to the target, so only gauges required for measuring a smaller range of 
possible configurations are necessary. 

Optionally, the gauges are selected responsive to a desired type of bond matching, for 
example, if the target and/or potential drug is known to include sulfate bonds, gauges including 
sulfate moieties are used. 

25 In an exemplary embodiment of the invention, a method of selecting a gauge subset 

comprises: 

(a) determining a use of the gauge subset; 

(b) determining a rale or rales for selection of gauges to meet said use (e.g., sizes, 
moieties, densities, etc., e.g., as above); 

30 (c) selecting fi-om the library a plurality of gauges that meet said rale(s); and 

(d) optionally, determinmg if the resulting library is likely to provide the desired 
information for said use. For example, a simulation may be made to see if the assay results are 
likely to result m a reconstraction (e.g., based on assay binding rate, density of coverage, 
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properties or target and/or degree of overlap required to distinguish between triangles on a 
gauge). In another example, the information is partial information and a simulation is run to 
see if the information can be distinguished. 



14.3 GAUGE LIBRARY DESIGN 

The following table shows an exemplary set of scaffolds for a gauge library design: 



AutoNom Name: 
Thiophene 



AutoNom Name: 
IH-Pyrrole 



AuloNom Name: 
Puian 



"N' 

AutoNom Name: AutoNom Name: 
Benzene Pyridine 



O 0 05-" 



AutoNom Name: AutoNom Name: AutoNom Name: 



AutoNom Name: 



N 

AutoNom Name: 



PyrimUine 



Pyrazine 



6H-'niienop,3-b]pyTTOle 1.6-Dihydro-pyrrolo[2,3-b]pyrTOle IH-Indoie 



CO 



AutoNom Name: AutoMom Nam«: 

ThfenotZ^dJpyrimldine fi'f^^^SXT*' 




AutoNom Name: 

3.4-DihydrO'^enzD[e] 
[1 .4]diazepln*5one 



AutoNom Name: 



N 

AutoNom Name: 
QidnoUna 



a 



AutoNom Name: 
IsoqulnoIIne 



AutoNom Name: 
Quinoxallne 



co ob 



3,e-Dihydn>4H-pym>lo „ 
(2.3-ettl .4]diazepin^ne M ,4Jdiazepln-5<ne 



AutoNom Name: AutoNom Name: 

3,4-Oihydro-thienol2,3-el 3.6-0lhydro^H-pym)Io(3^-el 



(1 .4)diazepirv>5-one 




AutoNom Name: 

5H.11 H-OibenzoMIl .5] 
dIdzoclne-6, 1 2-dione 




AutoNom Name: 

1 .4-0[hydro-10H-1.4,10-triBza-benzo 
[a]cyclopenta[e]cycfooctene-5, 1 1 -di 
one 




AutoNom Name: 

4H.i0H.l-Thta-4.l0^laza-benzo(a]cy 
clopenta[e]cych)octene-5.1 1-dlone 




AutoNom Name: 

D}pym>to|1,2-c:2'.1*-e] 
lmidazol-5-one 



AutoNom Name: 

1 .4.7.9-Tetrahydro-1 ,4.6,0-tetraa2a 
•0lcyc>O{>entaIa.e}cyclooctene-5. 10- 
dione 




AutoNom Name: 

4 J.9-Trihydro-1 -mia-4,6,9-tTiaza- 
djcyc<openta(a.e]cyciooctene-5.l0-d 




AutoNom Name: 

2.4.9-Tfihydro-llambda*4*.6-dlthia. 
4.9-dlaza-dicyclopenta[a,e)cyclooct 
eno-5,l0^ione 
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AutoNom Name: 

6.9^Dihydro-5H-1-thia*5.8,9-tiiazeh 
cyclopenta[a]azuten-4-one 



AutoNom Name: 

3.1&-Dmydro-4H-[1 .4]diaz6pino 
[5.6-b]indol-5-one 



AutoNom Name: 

3,6-0Ihydro4H-[1 .4]diazepino 
[6.5-b]indol-S-one 




AutoNom Name: 

7.8- Oihydro-1 H-1 .7, 10-triaza-cycloh 
epta[e]inden-6-one 

AutoNom Name: 

8.9- D(hydro-5,6,9,11-tetraaza-cyclo 
hepta[b]naphthafeiv1 0-one 




AutoNom Name: 

8.9-Dihydix>-3l-l-3,6,9-triaza-cycloh8 
pta(e)inden-10-on8 




AutoNom Name; 

3,4-Oihydro-[1 ,4]diazepino[5.6-b]qu 
inolIn-S-one 




AutoNom Name: 

7.8-Dihydro-1 H-1 ^,8-triaza-cyclohe 
pta[f]inden-9-one 




AutoNom Name: 

8,9-Dihydro-4.8, 1 1 -<riaza-cyclohept 
a[a)naphthalen-7-one 




AutoNom Name: 

11H-10.11-Diaz8- 

benzo[b]fluorene 



COjH 

AutoNom Name: 
a-hydroxyacids 



CO^H 

AutoNom Name: 
a-aminoacids 



R1 

AutoNom Name: 
cohels 



AutoNom Name: 
Bicydo[Z2.2]octane 



c 



& 



AutoNom Name: 
2-Mettiylene-2,3-dihydro- 
benzo[1,4]dtG)dne 



/3 



AutoNom Name: 

6,7-Dihydro-2H-pyTazlno 
[1.2-a]f:^midine 




AutoNom Name: 
9H-Ftuorene 




AutoNom Name: 
1 .4-Diaza-bIcycloI2.2.2]octan8 



AutoNom Name: 
1-Aza-bicyclo[2.2.2]octane 



AutoNom Name: 
Pyrido[2.3-d]pyrimidine 
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AutoNom Name: 

&-Methyl8nd-1 .5-dthydro 
-pyrnol*2-on6 



, N 



J. 



AutoNom Name: 

Benzo[4,5]imkfazo 
[1^-a]pyrimidlne . 



AutoNom Name: 

1.4-D(KydfD-benzo[4,5] 
imidazoll »2-a]pyrimidtne 




N 

N^N 

AutoNom Name: 

4.1 0-Oihydro-1 .4a.10-triaza 
-phenanthren-9-one 



AutoNom Name: 

1 ,5-Oihy(Mmidazo 
(1 ;2-a]pyTunidrn-2-one 



O 

AutoNom Name: 

1 .2>3.5-TetrahydrD-imidazo 
[1,2-a]pyi1m{dine 



dp 



AutoNom Name: 

TMazolo[3.2>a]thleno 
[2.3-d]p^mldln-5-one 




AutoNom Name: 

1.d^im{a-4a,i0^aza' 
cyclop8nta(b]nuoren-4-one 




ccP 



AutoNom Name: 

5.6-Dlhydro-1 -thia-5.7.8.9a-tetraaz 
a-cyclop8nta[a]azuten-4^e 



AutoNom Name: 

6.10-DmydrD-5H-1-thla-5,7.10a-tria 
za-benzo[e]azulen-4-one 



AutoNom Name: 

4.5-Oihydro-3-thIa-4.5a.1CMrlazB-C 
yclopenta[a]fluorene 



AutoNom Name: 
8H-1 -Thia-cyctopenta 
[apndene 




AutoNom Name: 

3-Tht&4,5a*lO-(rlaza- 
cyclopenta[a]fluorane 



AutoNom Name: 

6,7,9,11-Tetrahydro-104hia-6,9-dia 
za-lndeno[1 .2-a]azulene-5,8-dlon8 



AutoNom Name: 

2,3.6 J, 1 2.1 2a-Hexahydro- 
pyrazinoll '.2': 1 ,6]pyrldo 
(3.4-b]{ndole-1.4-dione 



AutoNom Name: 

S,10-Dihydro-4H-2.3a,10- 
triaza-cyciopenta[a]fiuorene 




AutoNom Name: 
5H-Pyrtdo[4.3-b]{ndole 



AutoNom Name: 

11H-lndollzlno[1,2-b] 

qulnolln-d-one 




AutoNom Name: 

1 .2-Dthydro-2.4a.9-triaza 

-anthracene-3, 1 0-dfone 




AutoNom Name: 
6H-lsolndolo[2»l-a]lndole 
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p 




p 





AutoNom Name: 
1 .5-DihydfO-benzo[b] 
[1.4]d]a2epln-2-one 



AutoNom Name: 

5.10-Dihydro-dlben2O 
Eb,e]I1 ,4]diazepin-1 1-one 



AutoNom Name: 

5.1 1 'Dihydro-benzo[e]pyi1do 
(3.2-b][1 .4]dlazep{n-6^ne 



AutoNom Name: 
4,9^ihydro-3-thia-4.9'diaza 
-benzo[Qazu)en-1 0-one 






S-^N 



AutoNom Name: 

Benzolg] 
quinoxallne 



AutoNom Name: 

Pyrazino[2.3-b] 
quinoxaane 



AutoNom Name: 

PyridoI2,1-b]quinazolin 
-11 -one 



AutoNom Name: 

1 -Thia-4a,9-dia2a-cyclopenta 
(b]naphthaleiv4-one 



ax 



AutoNom Name: 

2-Methytenfr4H-toenzo[1.4] 

thlazln-d-one 



TABLE I 



In an exemplary embodiment of the invention, the moieties are Me (methyl), Et (eteyl), 
Pr (propyl), Ph(phenol), CO2H, OH and NH2- Although the moieties may be connected at any 
of the R locations, not all the possible gauges are needed, as explained above. The indoUzine 
scaffold can have, at Rl, either COOH or NH2, both of which are shown in the table. In 
particular, applicants have found that in general, a scaffold with four or five attachment points 
can span its entire range of triangles with M moieties, using only about different gauges. 
This is believed to be generally true (e.g., the exponent is not much higher than 3) for scaffolds 
witii a larger number of attachment points. 

It should be noted that even if a library does not cover all the possible triangles, a 
viable reconstruction is still possible for many drug targets and/or considerable utility attached 
to the library. Also, as noted above, partial reconstruction is useful in some cases. Also, as 
noted above, gauge matching can be used as leads and/or to reject leads, even if no 
reconstmction is possible, in some cases. In some embodiments of the invention, a failure of 
the method is typically self-evident and does not create an unproductive search after non- 
existent leads. 

Alternatively to constructing a library of gauges fi^om scratch, at least part of the library 
can be generated by scanning existing libraries for molecules that include triangles having 
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desired sizes and/or moieties. Optionally, molecules that are small and rigid are selected, as 
described above. This type of library, for example, may not be based on a set of scaffolds. 

14.4 LIBRARY BUILDING METHOD 

From the above description, it should be clear that there are many methods that may be 

used to construct a Ubrary. The following exemplary method is described, at least partly to 
illustrate various applications of the above rules: 

(a) determine Ubrary parameters: e.g., spanning range and accuracy desired for library; 

(b) select moieties for libnuy, 

(c) select a scaffold; 

(d) generate gauges from the scaffold; 

(e) add generated gauges if they are suitable; 

(f) repeat (c)-(e) until the library spans the range with a desired accuracy and/or 
coverage; and 

(g) optionally, check Ubrary. 

Ih accordance with example embodiments of the invention, a resource allocation 
algorithm is used, for example the greedy method or the first fit method. These names refer to 
methods of selecting from a set of possible resources, which resource to allocate at a particular 
time, for example, which gauge to choose for a library from available gauges on a scaffold or 
which scaffold to add to the library. Many such methods are known in the art and may be used, 
noting that the method is not required, in some embodiments of the invention, to provide an 
optimal solution, just a working or reasonable solution. 

An alternative method is a selection-based Ubrary construction metiiod. Txi this method, 
existing molecule libraries are scanned for molecules that have gauge-like properties (e.g., as 
described herein). The resulting potential gauges may be filtered out to remove redimdancies. 
It is expected however, that in the current state of pubUc Ubraries, scanning such libraries will 
not yield a complete gauge Ubrary. Optionally, such a selected gauge Ubrary will be completed 
using other techniques, such as scaffold based gauge generation. 

It should be appreciated that given a large number of possible gauges and a smaller 
actual required number, there are many optimization techniques for selecting a suitable and/or 
optimal set of gauges that meet the required number. As noted above, the selection may be 
based on the use to which the Ubrary is put and/or be based on considerations such as diversity, 
chemical behavior and abiUty to synthesize. In addition, a part of a Ubrary may be Tq>laced, for 
example with a set of gauges constructed from other scaffolds or usmg molecules selected 
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from a library of potential leads. la an exemplary embodiment of the invention, at (g) a 
constructed library is optimized, for example, removing redimdancies and ensuring that desired 
distributions (e.g., of triangles, chemical properties) and overlaps (e.g., of lengths and/or 
moieties) meet certain guidelines and/or are optimal. 

5 14.5 SCAFFOLD SELECTiON METHOD 

In an exemplary embodiment of the invention, scaffolds in general are selected to have 
certain desirable properties, for example, one or more of: 

(a) small size; 

(b) rigidity; 

1 0 (c) suitability for combinatorial chemistry; 

(d) including a plurality of attachment points, for example, 3, 4, 6, 10, 12 or any 
smaller intermediate or larger number, for attaching moieties and/or chemical markers (e.g., 
for binding assays, chemical manipulation); 

(e) a geometric arrangement of the attachment points so that a range of triangle sides 
1 5 can be provided; 

(f) 3D structure, for example planar or volumetric may be preferred for different 
situations; 

(g) number of excess protrusions (in some cases may be desirable to be small), to 
which moieties may or may not be attached, so that excess is relative to a perfect scaffold 

20 where the useful (e.g., for the library or for a particular triangle) moieties define the shape of 
the scaffold; and/or 

(h) solubility (may be determined, for example, based on the mmiber of polar atoms in 
the scaffold). 

In general, as more attachment points for moieties are provided, the scaffold is more 
25 able to provide triangles of various sizes, however, this may adversely afTect the scaffold (and 
gauge size) and many of the triangles may be useless. In a scaffold in general, it may be useful 
to designate only some of the potential attachment points as attachment points to be used. This 
may reduce the nimiber of different synthesis methods used and/or promote uniformity thereof. 
Not all or even any of these properties are essential in some embodiments of the 
30 invention. As a practical matter, small rings and ring chains appear to meet these criteria. Thus, 
in an exemplary embodiment of the invention, a set of scaffolds may be generated by 
reviewing existing known rings and small chains for molecules that meet the desired criteria. 
In an exemplary embodiment of the invention, during this type of selection an effort is made to 
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select scafiTolds having a range of sizes (e.g., distances between attachment points), so that a 
range of triangles may be generated using the scaffolds. 

In addition to scaffold criteria in general, a selection of scaffolds for a library may 
impose other criteria, for example that the scaffolds generate a spanning library of gauges 
5 and/or a range of chemistries and/or require a relatively small number of relatively low 
complexity process to generate the gauges. 

In an exemplary embodiment of the invention, the scaffold selection process is as 
follows. Given an existing library portion, a new scaffold is selected from a Ust of available 
potential scaffolds if it answers at least one of flie following criteria: 
1 0 (a) the scaffold generates a large number of triangles that are missing from the libraries, 

for example, 10, 50, 100 or any smaller intemiediate or larger number, such as a user set 
number; 

(b) the scaffold generates at least one (or a small nxmiber of triangles, such as less than 
20, less than 10 or less than 5, or any other user set value) triangles that have evaded 

1 5 generation using other scaffolds and form missing portions of the library, 

(c) the scaffold has a significant amount of known chemistry (e.g., methods for 
manipulation and/or adding moieties); and 

(d) the scaffold adds the potential for a desired amount of overlap. 

In general, if a larger the number of gauges is produced, it may be easier to complete a 
20 library. However, not all scaffolds can generate large nimibers useful triangles. 

It should be noted that in some divergence based methods of Ubraiy design, each 
Ubrary element is selected to be as different as possible, so that this type of selection methods 
and/or at least some of the criteria used cannot be applied and run against conventional ideas. 

It should be noted that as the library fills up, consideration (b) may be given more 
25 weight, with the possibiUty of searching or constructing a scaffold that has the desired 
properties (e.g., to form required triangles). Further, the search may lead to selection of less 
rigid scaffolds, for example, to ensure coverage or due to lack of suitable more rigid scaffolds. 

In an exemplary embodiment of the invention, during an optional optimization stage of 
the library, scaffolds are assessed as to their quality (e.g., meeting scaffold criteria), number of 
30 triangles generated and/or imiqueness of triangles generated. A scaffold may be removed from 
the library if it is determined to be less useful or unneeded based on one or more of these 
considerations. 
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One difiTerence between scafiblds is fhe number of rings in a scaffold. In general, as the 
number of rings increases, so does the scaffold size and weight. For some ^plications, the 
numb^ of rings in a scaffold may be used as a heuristic to determine what approximate 
triangle sizes the scaffold can provide. For some applications, multi-ring scaffolds may be 
5 necessary. Alternatively or additionally, single or bi-ring scaffolds may be useful for small 
triangle sand/or for reducing steric clashes. 

14.6 GAUGE SELECTION METHOD 

In an exemplary embodiment of the invention, gauges in general are selected to have 

certain desirable properties, for example, one or more of: 
10 (a) small size; 

(b) large numbers of triangles; 

(c) high or otherwise desirable binding aflfinity, for example in the range of 1-100 
micro Molar; 

(d) rigidity; 

1 5 (e) ttie attached moieties defining the volume of the molecule; 

(f) relatively uniform binding probability for all moieties, for example a factor of ten 
between moieties and a factor of 100 between molecules in a library, however, in other 
embodiments other, smaller or greater factors (e.g., -1, 5, 20, 50, 130, 250, 1000 or any 
smaller, intermediate or greater factor) may be provided for one or both criteria; and/or 

20 (g) chemical behavior, such as (i) solubility, for example in a natural solute of the 

target (or an approximation thereof), for example water at a given pH, with some detergent 
such as DMSO to aid solubility, (ii) lack of reactivity with expected contaminants, (iii) lack of 
chemical reactivity (creation of covalent bonds) with a target protein i.e., with amino acids or 
known typical combinations of them and/or with a substrate, (iv) desired behavior over a range 

25 of properties. 

In general, a high^ imifonnity of binding means that the assays have a same meaning. 
However, it is generally not practical to provide such narrowly defined materials, and a certain 
latitude is useful if a realistic set of chemical is to be provided. 

When generating a library (or part thereof) by selection of gauges from existing 
30 molecule screening libraries, each molecule is, for example screened against the desired 
criteria. A molecule may be selected or rejected. Alternatively or additionally, a molecule may 
have a score of suitability associated with it. Similarly, a set of potential gauges may be 
generated from scaffolds. 
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In an exemplary embodiment of the inyention» gauges are selected &om the 
generated/selected set, based on one or both of suitability (e.g., relative or absolute) and 
meeting of group criteria. In an exemplary embodiment of the invention, one or more of the 
following group criteria are ^pUed, for example as binary criteria or as part of a score: 
5 (a) That uniqueness of the triangles provided and/or them matching missing triangles. 

(b) Matching of flexibility of the gauges and/or individual triangles, to desired 
flexibihty. 

(c) Shape of gauge as a whole, for example, being elongate or being round. The shape 
may be a consideration, for example when building a library in which shapes are varied so that 

10 steric-clashes will not reject all of a certain triangle. To this end, the shape of the gauge may 
interact with the location of specific triangle son the gauge, e.g., if a same triangle is found on 
two elongate gauges, it may be desirable that on one of the gauges the triangle is in an axial 
direction and in the other, in a trans-axial direction, Altematively or additionally, shape 
considerations relates to the three-dimensional shape of the gauge and/or relative layout of 

1 5 triangles in the gauge. 

(d) That certain non-triangle measures are found, for example specific non-triangle 
measures or that a uniform (or other) distribution of such 4- 5- or other multi-point measm-es 
are provided. 

It should be noted that for gauges and/or scaffolds, the detemwiation of suitabiUty may 
20 include, for example one or more of using simulation and molecular analysis software, 
chemical laboratory testing and/or searching literature for the same or similar chemicals. 

The above selection method may be useful when designing a single universal Ubrary 
(or a set of such libraries for broad uses). It should be noted however, that some, similar or 
other selection methods may be used when generating personal and/or ad-hoc libraries, 
25 searching for gauges or measwes with particular properties and/or when defining a gauge 
and/or scaffold to be generated. 

14.7 GAUGE SYNTHESIS 

The generation of a gauge library fi-om scaffolds, in some embodiments of the 

invention, may 2issist in the serial synthesis of the gauges. In libraries that are not (or are 
30 partially not) scaffold based, standard synthesis methods may be used. 

In an exemplary embodiment of the invention, the gauges are synthesized, for example 
using Uquid phase methods as described below, and impurities are removed using standard 
methods, for example using HPLC. 
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111 an exemplary embodiment of the invention, a parallel synthesis method is used, in 
which a plurality of gauges are synthesized at once and then separated. It should be noted that 
in some embodiments of the invention, only a small number of the gauges that can be created 
by a scaffold are actually needed. Altematively or additionally, even if many of the particular 
5 gauges cannot be created, a sufiScient number of alternative gauges may be available, to 
provide spanning and/or overlap of a desired triangle space. For example, on a five point 
scaffold with 10 moieties, 100,00 combinations are possible, of which 1000 are sufficient 
cover all the triangles. Thus the choosing can be, for example, ad hoc, such as based on the 
actual yield (e.g., relative yield) or based on the prior design of the library. 

10 Li an exCTiplary embodiment of the invention, combinatorial chemistry methods are 

used to attach moieties, each at a different attachment point of a scaffold, optionally so that all 
combination of moieties are created. Each final compoimd is made attached to a polymer bead 
(for example) for ease of separation. The beads may be color coded for assistance in separation 
and/or identification of the created gauge. 

15 Altematively, other soUd phase methods, for example as described below or as known 

in the art, are used. 

14.8 MiXED LIBRARY DESIGN 

As noted above, in order to be useful, a complete imiversal library is not required. 
Further, a gauge library may be included into a "regular" screening library. In an exemplary 

20 embodiment of the invention, at least 0.05%, 0.1%, 0.5%, 1%, 5%, 10%, 20%, 40% or any 
smaller, intermediate or larger percentage of the molecules in a library used for screening, 
measuring and/or other uses comprise gauge-like molecules. Of such gauges, for example, less 
than 50%, or greater than 30%, 60%, 80%, 90%, or any smaller, intermediate or larger 
percentage of the gauges are scaffold-based gauges, where a scaffold is used to generate at 

25 least 5 gauges with less than 20% overlap in triangles defined by attached moieties. As noted 
above, while a library may include standard screening parts, providing significant numbers of 
gauge-like molecules may assist in applying the methods described herein. 

In an exemplary embodiment of the invention, the library comprises at least 5,000, 
10,000, 20,000, 50,000, 80,000 ,100,000 or any intermediate or greater number of gauges. 

30 These gauges may be, for example, scaffold based gauges, plain gauges and/or rigid gauges. 
These gauges may span, for example, 5%, 20%, 40%, 80%, 100% or any smaller, intermediate 
or greater percentage of the triangle space, for example, with an overlap of 1.1, 1.5, 2, 3 or any 
smaller, intermediate or greater degree. As noted above, when spanning is better, the degree of 
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success may be higher, albeit at a cost of using a larger library. Smaller libraries may be easier 
to apply and still yield useful results, in many cases. 

One significant difference between gauges and other lead libraries (e.g., diversity based 
libraries), in accordance with some exemplary embodiments of the invention, is that a 
5 relatively large number of matches is expected using gauge based libraries. For example, at 
least 0.01%, 0.05%, 0.1%,0.2%, 0.5%, 1%, 3%, 5%, 10% or any smaller, mtermediate or 
greater percentage of numbers is expected to bind. The percentage of binding may depend, for 
example on the ratio between gauges and non-gauge leads in a library. 

It should be appreciated that these percentages are not mere numbers. Rather, they 

10 represent a qualitative difTerence firom Ubraries where more often flian not, no leads bind. The 
greater the probability of finding one or more leads and the greater the number of leads, the 
more likely it is that a drag will be fotmd. However, of binding is too likely, the quality of 
information provided by the binding may be reduced. 

A library may also include a mix of three-point measures and higher valance measures. 

1 5 While any gauge that includes more than three moieties includes a high valance measure, in an 
exemplary embodiment of the invention, the library is designed to span the higjier valance 
space. For example, the library spans at least 0,1%, 0.3%, 0.5% or at least 1% or any smaller 
intermediate or larger percentage of the space of the higher valance measures. The spaiming 
may be, for example, continuous (e.g., the whole library at a low resolution or part of the 

20 hbrary at a high resolution) or it may be discrete (e.g., isolated parts of the library), hi general, 
higher valance measures may require a very large number, for example, 20,000,000 for a 
spaiming equivalent to the 100,000 library of the triangles, so commercial implementation may 
depend on the availability of even more parallel assays than available today. Optionally, the 
higher valance measures are provided to be more flexible, so that a lower resolution is required 

25 to span the space. 

14.9 ENSURING LIBRARY RELIABILITY 

hi an exemplary embodiment of the invention, once a library is constructed and/or 
during its constraction, various quality assurance processes may be employed. In one example, 
the library is analyzed to ensure that it meets the spanning, overlap and/or accuracy criteria set 
30 for the library. Any missing triangle and/or gauge may be provided at this point or noted as 
missing. Altematively or additionally, molecules with low solubility or high toxicity are 
removed and/or replaced with molecules exhibiting similar spatial chemical configurations. 
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In an exemplaiy embodiment of the invention, feedback from use of the library is used 
to calibrate the library, reconstruction process and/or to assist in library design. 

In an exemplary embodiment of the invention, the theoretical modeling of the library is 
compared to its actual behavior, for example, by running test assays against randomly selected 
targets having a known and/or an unknown structure. Two examples of molecules with known 
structures are thoroughly mapped proteins and stmctures constructed from DNA or RNA, with 
optional attached elements. Optionally, the targets are not random and are selected to test 
certain assumption in the theoretical model of the library. Alternatively or additionally, the 
calibration is provided by analysis the results of real uses of the library over time. 

In an exemplary embodiment of the invention, one or more of the following data is 
provided by such analysis: 

(a) assay binding rates for gauges and families (e.g. similar) gauges; 

(b) dependency between envirormiental conditions and binding rates and/or conformal 
changes for one or more gauges; 

(c) Baysian probability of steric clashes between gauges (and triangles thereof) with 
overlapping triangles; 

(d) actual degree of overls^ between triangles; 

(e) dependency between target type and gauge binding; and/or 

(f) parameter values (e.g., thresholds) for the various algorithms. 

Other properties of the library, for example genial rigidity of the gauges and 
correctness of values in the data bank may also be provided by such or other analysis. 

In an exemplary embodiment of the invention, as a result of the above findings, the 
library is amended, for example, by removing redundant gauges and/or searching for gauges to 
generate the missing triangles. 

Alternatively or additionally, as a result of the above findings, later generation of 
libraries and sub-set Ubraries is modified to take the calibration information into account, for 
example in a specific manner as relating to specific gauges and/or in a general manner as it 
relates to statistical deviation of the behavior of scaffolds and/or families of gauges from their 
appropriate theoretical models and/or as parameters for such models. 

Altematively or additionally, the reconstruction process is calibrated, for example to 
better distinguish which triangle matched, the actual coverage of each triangle, the spatial 
shape (in triangle space) of a match and/or the relative binding strength of various triangle 
measures and/or gauges. 
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14.10 HUMAN INTERACTION DURING LIBRARY DESIGN 

The process of designing a library may be automatic, semi-automatic or manual, in 
general, when more potential gauges and/or scaffolds are available and suitable modeling 
software is available as well, automated designing may be provided, one example of this is 
5 once a complete library is available, selecting a sub-set may be completely automatic, once the 
desired parameters are provided. Some of the Ubrary may be generated automatically in any 
case, for example selection of gauges from existing libraries and/or selection of scaffolds from 
existing libraries. The determination of ease of synthesis may be required to be manual if no 
earlier information is available. It is noted, however, that in an exemplary embodiment of the 

10 invention, the scaffolds are chosen to have known chemical behavior and synthesis paths, so 
that attachment of moieties should require little or no research work. In some cases, however, a 
human may be required to not only select between alternatives but actually to find a particular 
missing gauge or suggest a scaffold design. It is noted, however, that the mathematical 
description of the Ubrary in accordance with some embodiments of the invention^ assists and 

15 may allow complete or nearly complete automatic generation of a library using constructive 
synthesis and/or analysis of existing molecules. Possibly, such a library may then be 
optimized, for example as described above, possibly manually, especially to assist in providing 
an easy to synthesize library. 

As noted above, the reconstruction process may be completely automatic or it may 

20 include a manual aspect, hi general, however, it is expected that the high hit rate of binding of 
gauges will reduce or eliminate any need for human intervention, at least in some of the steps 
of dmg discovery. Of course, once mapping is completed, a human user may desired to test the 
effect of various assumptions, for example, how the reconstructed layout depends on various 
assumptions made on the target conformity. Also, in some case a human expert (or an expert 

25 system) may be used to select among alternative or select likely leads, since in many cases the 
method will generate a small number of possibiUties from which one or two should be 
selected, failing that costs may be very high. 

Li an exemplary embodiment of the invention, one point for human intervention in the 
dmg discovery process is in designing drug candidates that match a jBnal pharmacophore (e.g., 

30 model). It is noted , that various software exists to assist or automate this step. Typically 
however (at this point in time), human judgment is better at assessing syntiietic feasibility for 
complex molecules. If the suggested dmgs are created by linking together gauges or simple 
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fiagments, however, automatic assessment and possibly generation methods, may be 
reasonable. 

15. EXPERIMENTS AND EXAMPLES 



15.1 EXPERIMENT 1 

5 Some of the above measurement method was testing using the following experiment. 

In this raperiment, known inhibitors of HIV-1 Protease were analyzed to detect a set of 
triangle measures that should exhibit binding to HIV-1 Protease. A set of molecules including 
the triangle measures were selected and physically assayed and shown to have the expected 
binding to HIV-1 Protease. The results indicate that triangles are a viable geometrical sub- 
1 0 structure that can be used to measure a target by binding. 

The following entries in the PDB (Protein Data Base) were extracted as structures of 
HTV-l Protease with known, bound, inhibitors: lajv lajx Idif Igno Ihbv Ihih Ihos Ihps Ihpv 
Ihpx Ihsg Ihte Ihtf Ihtg Ihvi Ihyj Ihvk Ihvl lohr Isbg lupj 2bpv 2bpw 2bpx 2bpy 2bpz 2\spj 
3tlh 5hvp 7upj. 

15 The structures were super-imposed using the protein as a reference fiame, so that the 

spatial position and orientation of the inhibitors was superimposed. The inhibitor molecules 
were then decomposed into moieties and those were clustered in space. Strong bonding 
locations were identified based on the same moiety in different molecules binding to a 
substantially same binding location in the protease. Confidence in these locations was 

20 increased by verifying that the protein moieties at those locations were compatible with the 
inhibitor molecule moieties. 

Triplets of the inhibitor moieties at the strong binding locations were selected as 
*triangles'\ Gauges, for example, of a gauge set as described above, that have those triangles, 
are expected to bind, or at least some of them should bind. 

25 The triplets were used as a query input for a search in MDL's ACD-SC (available 

chemical directory for screening). Molecules that matched the queries (moieties and size) and 
the rigidity requirements were selected, as shown in the following table. 



No. 


Compound 


MW 


Density 


Cat. No. 


mg for 








(g/ml) 




ImM 












in lOmi 


1 


---^ 


276.35 




S-83425-4 


2.8 
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32 




295.43 




RJC-03631 


3.0 


33 




212.25 




RJF-00720 


2.1 



34 


OMe 


^OMe 


302.41 




85,612-6 




35 


oic 

o 




252.23 




25,272-7 




36 




267.54 




29.126-9 




37 


F 

kV 


OH 
O 

-F 


258.12 




23 319-6 




38 


CH, 




265.94 




30,118-3 




39 


HN^NH, 


308.34 




16,263-9 





TABLED 

The molecules numbering up to 33 were expected to exhibit binding behavior, due to 
5 them including at least one triplet. The molecules numbered 34 and up are superficially similar 
but do not include the required triangles. 

All of molecules were actually assayed and appeared to show activity (effect on HIV-1 
Protease) at various concratrations (between 10 and 1000 micro-molar). Of these molecules 1- 
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33 about 60% were found to be active, in particular molecules 7, 9, 23 and 27. Also molecules 
34-39 were assayed, with no activity shown, as expected. 

As noted above, these results appear to indicate that gauges, in general, that have a 
triangle measure that matches the target layout, should, often enough, bind in a detectable 
5 manner. 

15,2 EXPERIMENT 2 

In this experiment, assay results performed by others were used to reconstruct the 

spatial layout of binding locations, for known molecules and then compared to the current state 
of the art. 

10 The NCI maintains a database of molecules that have tested positive for activity against 

HIV. 43,000 results (in the October 1999 release) are available at "http://dtp.nci.nih.gov", 
under •*pubUc data", then "results from AIDS antiviral screen". From these molecules were 
selected a subset that showed at least a moderate level of activity and were rigid enough to 
allow determination of the spatial position of all their moieties. This resulted in fewer than 200 

1 5 molecules. The moiety triangles in these selected molecules were clustered. 

The clustering results showed a good match to the results of experiment I and the 
triangles of the molecules were found in the PDB structures. 

These results appear to indicate that a set of gauges (e.g., the molecules that were tested 
for HTV) can be used to measure and then reconstruct an active area. 

20 In addition, these results appear to indicate that at least part of a suitable library may be 

generated by selecting suitable gauges from available libraries, rather than by constmction 
using scaffolds. It should be appreciated that it may not be required to determine the spatial 
positions of all the moieties, for example only of the moieties with a high binding affinity. 
Moieties with low affinities may be removed, in some cases. 

25 16. SYNTHESIS BOOK 

Following is a synthesis book, arranged in chapters, for some of the scaffolds (and 
gauges derived from them), shown in table I. A most important aspect of this synthesis is that 
it illustrates that suitable scaffolds and gauges are available and can be generated using known 
chemical processes applied to standard or modified sources and/or by changing their 

30 parameters in an expected manner. The references described in this book are incorporated 
herein by reference. In any case, the partial library described in the appendix has at least the 
property that is can serve in many cases to provide a partial reconstmction and/or a significant 
increase in lead matching. 
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It should be appreciated that the novel materials described in the book, the 
manipulation methods thereof, synthesis methods thereof and groups of molecules from this 
book are also considered to be within the scope of at least some aspects of the invention, for 
example, a library including one, two, four, six, eight or any intermediate number of scaffolds 
5 as described therein. Altematively or additionally, a library in accordance with an exemplary 
embodiment of the invention, includes at least 100, 300, 500, 1000, 2000, 4000, 10,000, 
20,000 or any smaller, intermediate or larger number of gauges from this book. While it is 
useful to select gauges from the book, for example by using the scaffolds described therein to 
span part of the library, this is not required. 
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16.1 Benzenes, Pyrimidines 6-membered ring scaffold 

The Biginelli dihydropyrimidine synthesis (pathway below) is a promising multi 
5 component condensation, which involves the one-pot cyclocondensation of P*ketoesters 2, 
aldehydes 3, and ureas 4 providing the heterocycle 1 , which can be oxidized to the 
corresponding pyrimidine moiety. 

O^H 

R3^N^R4 R3^N^R4 R^^O "^N R4 



Biginelli-general multicomponent approach. 

10 

Several protocols have been developed for solution phase Biginelli reactions^ In order to drive 
the reactions to completion, however generally, an excess of two of the three components 2-4 
has often to be employed, and purification steps are required. The solid phase synthesis 
provides the desired dihydropyrimidines in good yield and superior purity directly after 
1 5 cleavage fix>m the resin ^ Q)athway below) : 




GABAuma Wang rosin 



TFA,DCM 
67-98% 




Another approach for the SP synthesis of higjily substituted pyrimidines was recently 
published^. In this work the synthesis starts from polymer-bonded thiouronium salt 5, which 
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iind^goes cyclocondensation with acetylenic ketones 6 to fonn carboxy pyrimidiiies 7 
(pathway below). 



CI " Ni^ >^ 2 



5 6 



i 



COjH 



R 

7 

Te;tra substituted pyrimidines a can be prepared via a modified Bigeneili's synthesis as 
described in the pathway below: 




First the imidine functionalities are formed on the acid labile resin to produce the resin 
immobilized amidines 23^, urea 24 and guanidine 25^. Actually, these amidines served as the 
first Biginelli building block. Next, the addition of the two other Biginelli building blocks, 
namely 2 and 3, to 23-25 leads to generation of dihydropyrimidine scaffolds 20,21 and 22^* 
respectively. The consequent reduction of ketone moieties (NaBILj, BF30Et2) leads to 14, 15 
and 16, which after cleavage (TFA, DCM, 1:1) followed by mild oxidation (CAN, CH3CN) 
affords the desired pyrimidines 8, 9^ and 10 respectively. The CAN could be removed, after 
the completion of the oxidation, by Solid Phase Extraction (SPE) or by sunple 96 well SePack. 
Other oxidation reagents such as MnOz^, O-chloranil ^ KMnO/, and CrOa, AcOH, H2SO4* 
can also be used. In case R3 = OMe (when the building block 2 is p-ketoester ) 
dihydropyrimidines 20, 21 and 22 undergo hydrolysis of ester (LiOH, THF or 5%alcoholic 
KOH^^ producing the 4^arboxy -dihydropyrimidines 17, 18 and 19 respectively. Following 
by the same mode as for 8, 9 and 10 (1. TFA, DCM, 1:1; 2. CAN, CH3CN) 17, 18 and 19 react 
to give the sub-library of 4-carboxy-pyrimidines 11, 12 and 13 respectively. It should be noted 
that in case of unsynmietrical 1,3 diketones 2 a mixture of 2 isomers are obtained. 
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•r 

R2 O 

HO N R,(fy 



20 



21 



R2 O 

juc: 



^ 22 



B.ChenerB.WO 95/16712. 1995 

R2 O R2 



.iX: 



20 



R4^N^R,(RJ 2. CAN. DMA 

•r 14 



UOH (THF) 



1.TFA/DCM 
Z CAN, DMA 



J||^^^^^^ NaBH^. BF,QEt a jf''^^ 




R2 O 



•r 



21 



HO N R^CRj) 
15 



UOH (THF) 



R3-OMe 




N 

HO'^N' 

W 18 

R2 



OH 



l.TFA/DCM 


R2 

HO N^R,(Rj) 

9 


2. CAN. DMA 

l.TFA/DCM 




XI 

12 


2. CAN. DMA 



1. TFACCM 



H^'^N^R,(R,) ZNaBH,. BocHN N R,(RJ 2. CAN. DMA 



R2 



HjN-^N^R^CR,) 

10 



UOH (THF) 



RSsQMe 



R2 O 



OH • 



19 





R2 O 


1.TFAA3CM 


N-'W^OH 


2. CAN. DMA 


HjN'^N^RI 

13 



core i^proach towards tetrasubstituted pyriimdines. 
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It was demonstrated'^ that dihydropjrimidine S-carboxylic acid can be transfonned into 
carboxylic azide which in turn xmdergoes Curtius rearrangement to give isocyanate. This 
reaction provide an excess of S-amino dihydropyrimidines A. 
R2 

.R3 



arbuzov 



(Ph)3P<^R2 wittigR,CHO 



P(Ph)3,EtONa d^f^2 EtONa, DMA. reflux 



28 



27 




HN 



R4 
23 



NH, 



DMF. ^0(fiC 
16hr, air 



RIv^^N^ ^R4 



R3 
a 



Pyrimidines can be prepared by cyclocondensation of amidines with a-|3 unsaturated 
ketone. Recently, the researchers have published the synthetic work*^, in which they describe 
the utilization of the Wittig reaction in fomiation of a,p-unsaturated ketones on SP for the 
synthesis of the various heterocycles. We propose the altemative three-step synthesis of 
pyrimidines a in solution, based on the fonnation of the a,p-unsaturated ketone building 



1 0 blocks 26 as a key step is described below: 

a-P-imsaturated ketones 26 can be obtained in good yields and purity by Wittig 
reaction of the appropriate aldehyde and the corresponding triphenylphosphonium bromide 27 
with NaOEt at reflux in DMA. The phosphorus yields 27 are readily available from a-bromo 
ketones 28 by the Arbuzov reaction, followed by treatment with a strong base, such as NaOEt 
1 5 The reaction of ketones 26 with various amidines 23 (Fig 4) affords, the desired tetra- 
substituted pyrimidine sub library a. 

Small sub-libraries b-g having one or more constant functional group on the six 
member aromatic ring, are characterized by better solubility. 
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HO^N^R4 HO^Ny'^^ HO^-^OH 

hOyVn ^A^n 



R1 O R1 R1 

e 



b 



V 



e 



A series of 2,5,6-trisubstituted-4-oxoKiihydropyri]iudines 29 can be prepared by SP using a 
cyclization-cleavage strateg/^ from readily available amidines 23 and resin attached a,^- 
nnsaturated carboxylic acids 30^^ (see pathway below). Compound 30 is obtained via coupling 
of the polymer and acyl-chloride 31 (derivetized from commercially available a,p imsaturated 
carboxylic acids. 



o ^ 



R1 "^23"" , '^^N-^NH CAN.CH3CN ^ '^^Y^'^ 

30 "^02 HCI(33./toluene R2-^N*^r4 R2^n'*^R4 
950c 

29 b 




DCM, 
DMAP 



R1 

31 



OH 



Compounds 29^^*^ can be oxidized (CAN, CH3CN) to corresponding pyrimidines b. 
A solid phase method for the preparation of Knoevenagel condensation products from resin 
bound malonates and malonic acids has potential for the prq)aration of hetero- and carbocyclic 
compounds, (see pathway below) 
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1. Meldnjm's 
add 

-OH 



^ ^ Dloeridlna acetate ^ ^ w^^iijw^. 



2. CF3CH2OH piperidlne acetate ^ |l 

34 R1-^ 33 




Malonic acid monoester (see pathway above) are prepared from macroporous Wang resin 
(AgroPore, Argonaut Technologies)^^ by treatment with Meldrum's acids. 
Conversion of the unsymmetrical ester 34 was achieved by treatment with trifluoroethanol and 
Die, followed by Knoevenagel condensation with the aldehyde in the presence of piperidine 
5 acetate to give substituted methylene malonate 33. For the bulk resin preparation of 33 (2-lOg 
of resin), the Knoevenagel condensations are carried out with Dean-Stark trap to eliminate 
water which gave consistently higher yields and faster reaction). Malonates 33 are treated with 
10 equivalents of the amidine hydrochlorides 23 in dimethylacetamide (DMA) solution, with 
excess K2CO3 to neutralize the HCl amidine salt, at 70°C for 4-8h to give resin bound 
10 dihydropyrimidones 32 . The reagent consumption progress can be monitored by FTIR 
observing the adsorptions of C=N and C=0 groups. Oxidation of 32 with 0.2M eerie 
ammonium nitrate (CAN) in DMA*^ affords resin bound hydroxy-pyrimidines. Cleaviiig under 
acidic conditions (TFA/DCM, 1:1, RT, l-2h) gives secondary sub-library c (The sub-library c 
exists in its tautomeric form- 4-pyrimidone). 

15 

The examples for tailor-made synthesis of miscellaneous tetrasubstituted 6-atom 
membered rings are described below. 

The amidines 23-25 react in solution with commercially available 
[bis(methylthio)methylidene]malononitrile 35 (see pathway below) in the presence of DIEA*^ 
20 to give the corresponding methylthiopyrimidines. The latter are oxidized with L2 equiv. of m- 
CPBA in DCM or to form the intermediate sulfinyl derivatives 36 which are subjected 

to amine substitution with NH3 ( dioxane room temperature)leading, after nitrile hydrolysis 
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(TFPA)*^ to the final aminopyrimidines 37. If LiOH is used instead of NH3 the corresponding 
hydroxypyrimidines 38 after nitrile are obtained. 



1 

24 



H,N NHj 

25 



1 -J 



23 



SMe^_^CN 

/ \ 
SMe CN 



35 



l^cCN. DIEA.O«C-RT 



2. m-CPBA. DCM 



N 
36 



OjMe 
CN 




NH, 



NH, 



1. NH3, .Dioxane 

F 
F 



37 



1. LiOH, Dioxane 
2- F>JY«y< 



38 



5 A series of various 3,4,5-trisubstituted phenols 39 can be synthesized in high yields using the 
"cyclization-cleavage " approach 
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Base catalyzed reactions between a^P-unsaturated ketones and polymer bonded acetonyl 
groups 42 (see pathway above) result in a tandem Michael addition/annulation reaction with 
5 concomitant cleavage from the resin to obtain the desired phenols 39 The synthesis initiates 
using resin prepared from Merrifield resin by coupling with Sodium 3-hydroxypyridine, 
producing higher loading capacity resin 44, which was successfully quartemized by 1- 
bromopropane-2-one (or 2-bromo 1 -phenyl propn-l-one; 2-bromo 1,2 diphenyl ethanone; 2- 
bromo-1 -phenyl butane-l-one; 3-bromo butan-2-one) to afford poly-pyridinium salt 43. 
10 Reaction of 43 with a,p-xmsaturated ketones was carried for 16 h, and after filtration of the 
resin the library 39 is obtained. 
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16.2 lndolo[2,3'b]quinoHne 6,6,5,6 cyclic scaffold 

The indolo[23-b]quinolines la,b synthetic pathway is outlined in the patiiway below. The key 
5 step in this synthesis is the decomposition of the con^sponding triazoles 2a,b in 

polyphosphoric acid (PPA) at 1 10-160*^C, which affords the desired la,b . The isomers 2a 
and 2b can be separated during Purification. The starting triazoles 2a,b can be prepared by 
heating trisubstututed chloroquinolines 3 with benzotriazole building blocks 6a,b at 1 10- 
120®C in presence of TEA. The benzotriazole building blocks 6a,b is prepared from 
10 monosubstituted nitro-anilines by reduction of NO2 group (SnCla or H2/Pd) and subsequent 
diazotization of readily obtained diamines. 




R1 



synthesis of Indolo[2,3-b]quinoline 

15 2-chloro-quinolines 3 is prepared in three steps from disubstituted anilines first the aniUde is 
formed either by reaction with preactivated (BTC, DMAP, coUidine) p-keto-acids, or with the 
free acid at high temperature followed by intramolecular cyclization of 5 under acidic 
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conditions. Finally the obtained quinolinone is chlorinated with freshly distilled POCla^ to 
afford 3. Another approach, namely solid-phase synthesis of la,b, can be utilized using 
disubstituted anilines with solid support attachable functional groups (CO2H, NH2, OH). 

R3 



R1 



RT 




AcOH, SMNP 




POCIj 





solid phase synthesis of Indolo[2,3-b]quinoline 



The starting anilines can be loaded on appropriate resin according to the type of the functional 
group to be attached. If the functional group is CO2H, the resin will be phenolic (see quinoline 
chapter change formulation according with the format of the patent) and the loading is 
performed under esterification conditions (BTC, DMAP); if the functional group is OH, the 
loading can be performed by Mitsunobu reaction; and if the functional group is NH2 the 
starting aniline will be loaded under sulfonation conditions on sulfonyl chloride resin or 
alternatively prepared by Curtius rearrangement from corresponding carboxyl derivatives. 
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f 6.3 Isoindoloindoles and isoindololndolones 6,5,5,6 tetra cyclic scaffolds 

Herein, is described the Pd catalyzed annulation^ to form an isoindoloindole skeleton from 
5 readily prepared imines and internal aryl acetylenes. 

Itnines and disubstituted acetylenes undergo a multistep reaction in the presence of palladiiun 
catalyst to produce iso^ndoloindoles^ which are obtained in good yields (see pathway below). 




^ isoindoloindole 

general schCTie for the preparation of isoindoloindoles 



By using divers building blocks - either mono- or di-substituted iodo-anilines 7, and premade 
di- or trisubstituted phenyl acetylenes 5. 

A large library of isoindoloindolsl-4 can be obtained (see pathway below). 




This annulation reaction comprise of two steps synthesis without isolation of intermediate 
iodoimines 6. The steps of the synthesis are as follows: 
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1. Imine 6 is formed in solution using drying reagents such as TMOF, molecular sieves or 
Na2S04. 

2. The acetylenes 5 are prepared by Heck reaction between commercially available or pre- 
formed di- and monosubstituted iodobenzenes and monosubstituted acetylenes using 
standard Pd catalysts (see pathway below). Modified Heck reaction on solid phase 
can also be used When we use solution phase, the reaction mixture can be used for 
the next step as it, without recovering the catalyst, because the one is required for the 
next step. 




Heck reaction for the preparation of substituted phenyl acetylenes 



3. The annulation of internal alkynes to isoindoloindoles using Pd(OAc)2 in the presence 
of an amine LiCl or BU4NCI in DMF. 

When one of the substituents is at ortho- position, the ring closure will proceed in 

regioselective manner affording single tetra-substituted isondoloindoles 1,3. 

When ortho- position on 5 is imoccupied, some substituents control regioselectivity of ring 

closiu"e by chelating the palladium in the a-palladium intermediate, which is formed during the 

reaction. Other cases the two isomers can be separated by chromatography. 

For generation of 1 1 -hydroxy isoindoloindoles: TMS protected hydroxyalkyne 11 can be 

utilized, generating after TMS removal (n-Bu4NF) 11-hydroxy-isoindoloindole sub-library 12 

(see pathway below). 
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R2 OTMS 



OTMS 




R1 OH 



For generation of 1 l-amino-isoindoloindoles, carboxyalkyne 5 can be used for preparation of 
11-carboxy-isoindoloindoles 1-4. The last can be converted to corresponding azodocarbonyl 
14 (w-BuOCOCl, then NaNa)*^*^ (see pathway below), which can undergo rearrangement 
through nitrene intermediate to provide desired 1 1-amino-isoindoloindole sub-library 13. 



R2, 



R1 



R3. 



R4 



Cp^Et 



Pd. cat 




R1 COjEt 




reflux 




13 



Constant polar functional group can be added such as guanidine. The most convenient location 
for this purpose is the para position on phenyl ring derived from imine 10 (see pathway 
below). The imine 10 bears Bpoc protected amine group, which can readily be deprotected, 
after annulation with appropriate alkyne, to give 9. Amino isoindoloindole 9 can react with 
bis-Boc thiourea*^ (HgCb, TEA) to obtain, after subsequent deprotection (TFA/DCM), the 
fimal library 8. 
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1 6.3. 1 Isoindoloindolones 



A slightly modified isoindoloindolone scaffold (see below) can be prepared by two systematic 
routes: 



R2 








N 




O 


R3 



R4 



A schematic description is shown in the pathway below: 
The approach presented above is divided to three major steps: 

1. Formation of di- or tri-substituted indoles: via — Heck reaction between an acetylene 
and iodoaniline 

2. Benzoylation of indole ring with or/Ao-iodo-benzoyl moiety. The coupling of 
disubstituted ortho-iodo benzoic acid BB to indole 18 can be carried out in to ways: 1. 
Regular coupling of BB to indole using DCC/DMAP"; 2. Using a pre formed acid 
chloride*"' 
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i. Cyclization using Pd catalyzed reaction (Heck annulation)^^' The addition is very 
specific using iodo-benzoyl ring. In case the 7* position is not occupied it can add to 7 
position of the indole instead of position 2. This addition gives us a new scaffold, 
which is another library (see pathway below). 




The indole 18 can be prepared by traceless solid phase indole synthesis using indole N-H as a 
resin attachment point which could be cleaved to give the free indole 18. One of the most 
efiScient solution phase methods of indole synthesis is the Pd(0)-mediated reaction of 2-iodo- 
anilines with acetylenes in the presence of base as developed by Larock 
Monosubstituted 2-Iodoaniline, after loading onto the THP resin through an aminal linkage 
using PPTS can give 20 (see pathway below). Replacing the catalyst to Pd(PPh3)2Cl2 and using 
the DCE soluble base TGM, were found to be beneficial in pushing the annulation reaction to 
completion, affording 19. Resin cleavage with 10% TFA then can give the free indole 18. It 
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was found that TMS-substituted acetylenes readily went to completion at 80°C with almost 
complete regiocelectivity. 

The carboxylated 15 (R2 = CO2H) can be converted to amine analog 16 through the 
corresponding azodocarbonyl, which can undergo rearrangement through nitrene intermediate 
to provide desired amino-isoindoloindolone sub-Ubrary. 



n, 8317, 1998 



28 



10%TFA/OCM 



l.PdCPPhaVKOAc 
2. NaN, 



26 



27 



TMS — -R2 



TMG (10 eqidv). DMF. 110^ 
29 2 cycles 



TEA, Td. ref. few hours 



I.PdCPPhjVKOAc 








DMF/110<C 








2. UOH, THF/HjO 


23 


0 . 


R3 







JZ 










O R3 


24 





N,00 




Tel. t-BuOH. ref 



BOCNH 




I TFAmCM 














O 

25 


R3 



X = 0. COa 



preparation of isoindoloindolone 



10 The hydroxy- and carboxy isoindolones 23 (X = O, CO2) can be generated by SP synthesis (see 
pathway above) starting by loading the appropriate iodo-aniline on the resin * and effecting the 
annulation with TMS acetylenes. 

The subsequent benzoylation and annulation of 27 followed by cleavage from the resin affords 
25. 

15 
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A second way of fonnation of isoindoloindolones presented in the following pathway'^: 




o 



A key 



step is an intramoleciilar wittig reaction. Substituted orT/io-allcyl anilines and phtbalic 
anhydride derivatives react to form aryl phthalimides. These can be converted to phosphonium 
5 salts and can be closed to isoindoloindolone system. 
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16.4 The single atom scaffold 

The smallest scaffold used in this implementation is the single atom scaffold, namely one 
carbon scaffold, of the general stmcture a: 




The library a consists of several sub-Ubraries b-e (see below) that represent compounds with 
one constant functional group and independent variety of substituents around the carbon: 



"v-^ "V-^ "'v^ X 



The secondary sub-libraries comprising two or three constant polar functionalities (see below) 
may be somewhat limited, because of the chemical unstability of molecules bearing two or 
three geminal amines or hydroxy! atoms (compounds f-j): 




f g *^ i j 

However, the synthesis of the a-amino acids k, a-hydroxy acids m and a-dicarboxylic 1 acids 
are known. For example they are described in: Synthesis of optically active a-amino acids by 
Robert M. Williams, Pergamon Press. 
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Some of the compounds based on the carbon scaffold are mostly commercially available. 
Those that are not commercially available can be synthesized, mostly in solution, by 
conventional methods. 

The tetriary alcohols b^ can be synthesized through the well known epoxidation of olefins 2( as a 
key step, producing epoxides 1, which already possess the required substituents^ (see pathway 
below) 



=< 



R1 



m-CPBA. DCM 



R2 



1. TFMSOjO, THF, -40oC 

2. NHg/dioxane. 0<C 



O R1 
R2 




LjAIH4 
THF 




Electron-donating groups typically increase the rate. Conditions are mild and yields are high. 
The consequent reduction of epoxides is easily carried out. The most common reagent is 
LiAfflU, which reacts through the inversion of configuration 2^. As expected firom the SN2 
mechanism, cleavage usually occurs so that the desired tertiary alcohol b is formed. Product b 
serves as the starting material for the tertiary amines c, which are obtained firom b by 
substitution of corresponding trifluoromethylsulfonate with ammonia in dioxane. 
The solid phase preparation of the tertiary alcohols b has been recently reported"^. Actually, this 
new cleavage strategy involves addition of carbon nucleophiles to ester bound polymers 3, 



o 



R1 



R2MgX 
THF /Ether 



HO^^R2 



R1 



-OH 



R2 
b 
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By this mode can be prepared only tetriary alcohols with two identical alkyl or phmyl 
substituents (R2), thus, limiting the diversity of the products, but still able to generate rapidly 
the secondary sub-library of the tetriary alcohols. 

The a-hydroxy acids m can be obtained by straightforward one-pot procedure from the 
corresponding a-keto acids 4 (pathway below), a oxo acid 4 are commercially available and 
their treatment with Grigniard reagents (2 equiv., THF, -40**C-RT) lead to the desired m 
products. 

2equiv. R2MgBr 




O 
m 



A SchiflFbase activated glycine supported on a soluble polymer (PEG) 6 can be readily 
alkylated with the wide variety of electrophiles in the presence of carbonate base (CS2CO3) in 
acetonitrile^ providing non-stereospecific amino acid esters. 
Similarly, Schiff base activated amino acids /-Butyl est^s 8 can be alkylated to a-C 
disubstituted analogs 7 (pathway below) using alkyl bromides and the LDA as a base (LDA, 
THF, -40^C). 
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The Schiff bases 8 can be prepared by transimination of the commercially available t-Bu ester 
of amino acids 9 with benzophenone imine. Finally, The alkylated product can be totally 
deprotected by TFA/DCM yielding the desired secondary sub-library k. 
It should be mentioned that all products generated in this chapter are enantio-unselective and 
require separation of enantiomers on chiral column. The utilization of racemic mixtures could 
be also considered 
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f 6.5 Benzodiazepines 6,7 bicycllc scaffold 

Benzodiazepines are fher^eutic and anticonvulsant agents. As such the 1,4 benzodiazepines 
have been the target of several soUd phase synthetic strategies. 

The synthesis of 1,4-benzodiazepines, is based on the closure of a seven membered ring, via 
lactamization in high yield. 

A slightly modified solid phase approach, which is based on the ring closure, via an imine 
moiety is described in the pathway below. 




Solid phase synthesis of benzodiazepines 

According to this strategy the aldehyde resin 1 ^ is coupled to P-amino-alcohol 2 via reductive 
alkylation (Fig 1). p-aminoalcohol (2) can be prepared in two altemative routes (see pathway 
below): 

(1) Coupling of A^-methoxyhydroxamate (8) with Griniard reagents (R2MgBr) to obtain 
the corresponding ketones, followed by reduction using NaBHU (MeOH, rt, few hours) 
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to afford the Boc protected amino-alcohol derivative (9). Removal of the protecting 
group yield 2. 

(2) Reducing A^-methoxyhydroxamate (8) with LiAlHU to the aldehyde derivative followed 
by coupling with Grinard reagents (R2MgBr) to form the Boc protected amino-alcohol 
derivative (9). Removal of the protecting group yield 2. 



BocNH 



R1 Me 
8 



,OMe ^' RzMgBr. THF. rt 
2. NaBH4. MeOH 



BocNH^ >*^..,OMe I.LIAIH4.THF 
R1 Me 
8 



R1 



2. RzMgBr. THF, rt 



BocNH^^ 
R1 



3. HCI, EtjO 



9 

Synthesis of P-aminoalcohol 




The coupling between the aldehyde resin (1) and the amino-alcohol hydrochloride salt (2) is 
1 0 done via reductive alkylation using NaBH(OAc)3, 1 % AcOH, DMF to give the resin 

immobilized p-amino-alcohols 3. To avoid racemization, it is desirable to obtain equilibrium 
between the resin boimd aldehyde 1 and P-amino-alcohols 2 before addition of the reducing 
agent to the reaction mixture. 

Coupling between the secondary amine 3 and Boc protected disubstituted anthranilic acids 4 
1 5 leads to resin bound intemiediate 5. Oxidation of the hydroxy group to affords 6. The 
oxidation on solid support can be carried out by Py.SOs ^ complex in DMSO at room 
temperature, or by the alternative procediure using NMO (N-methylmorpholine N-oxide) 
with TPAP (tetra-n-propylammoniumpemithenate) catalyst, in DMF at room temperature. 
Compoimd 6 is dq^rotected (TFA/DCM), and the free amine undergoes intramolecular 
20 cyclization under acidic conditions to obtain the desired benzodiazepine 7. 
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Introduction of amine or hydroxyl at position 3 of 1 ,4 benzodiazepines will result in 
decomposition of the material. At position 2, an OH group will isomerise to the keto form, 
while an NH2 group can form tautomers with the imine group. 

The synthesic route for the preparation of a benzodiazepine having an NH2 substituent at 
5 position 2 is described in the two pathways below: 

(1) Thioamino ester (10) is loaded onto aldehyde resin 1 by reductive alkylation 

(NaBH(OAc)3, 1% AcOH in DMF) to obtain resin bound intermediate 11 (Fig 3). The 
secondary amines (11) is coupled with disubstituted anthranilic acids (12) (BDC, NMP) 
to form amide 13, which can undergo the intramolecular cyclization using lithiated 
10 methoxy acetanilide (14) ^ to give tfaiobenzdiazepine 15. The cyclic resin boimd 



20 



15 



(2) 



tfaiointermediate IS is submitted to methylation (Mel) followed by oxidation to 
generate preferable leaving group (namely methylsulfoxide) for nucleophylic 
substitution. Such substitution reactions can be operated with acid labile dimethoxy 
benzylamine under standard conditions (16)(DMF, DIE A) providing after acidic 
cleavage the desired 2-amine benzodiazepine sub-library 17. 

An alternative synthesis of 2-aminobenzodiazepine is as follows. Benzodiazepine 2,5 
dione (20) is formed by coupling of substituted anthranilic acid with amino-acid 
followed by ring closure, which reacts with Lawesson reagent to form intermediate-2- 
thiobenzodiazepine-5 one (21). The amine 22 is obtained by reaction between the 
benzodiazepinetfaione 21 and ammonia. 
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synthesis of 2-aminobenzodiazepine 
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NO2 




COOH 



hydrogenation ^ 




Lawesson 
reagnt 

Ri ^ 




alternative synthesis of 2-aminobenzodiazepine 



10 



The synthesis of P-hydroxy a amino- acid, a building block used for the preparation of 2- 
carboxy benzodiazepine is described in the following pathway. Commercially available chiral 
Fmoc serine t-butyl ester 26, imdergoes Swom oxidation ((C0C1)2 , DMSO) to obtain the 
aldehyde 27. The aldehyde 27 is subjected to Gringard reaction RlMgX to form the Fmoc 
protected amino-alcohols, which after Fmoc removal (piperidine, MeOH) leads to desired 
building blocks 28. In case when both Ri and R2 are caiboxyl groups, the starting material is 
di- 1 Butyl fumarate 23, which upon epoxidation (mCPBA, NaHCOa, DCM) gives the epoxide 
24, followed by ammonia in methanol to afford 25. 



COjtBu 



mCPBA. NaHCOs DCM 



COJBu 



NH, in MeOH 



HQ 



COjtBu 



HjN 




COjtBu 



COjtBu 



23 



24 



25 



1--°" Swem ox. 1- Qeq) R,MgBr. THF R1 

I'^/^rNus.. ^.-^^^ 2. Piperidine. MeOH w M^^f 



FmocHN^COjtBu FmocHN-^CO^tBu 2. Piperidine. MeOH HjN-^^COjffiu 

Preparation of b-hydroxy amino acid 



26 27 
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The synthesis of benzopyridodiazepine**'" 33 is described in the pathway below. 2-chloro-3- 
aminopyridines 29*^ is coupled with disubstituted azidobenzoyl chloride building block 30. 
Reduction of the azide 31 with SnCh provides the 2-chlorooxazolidine intermediate 32, which 
upon treatment with acid rearranges to the desired pyridine-based tricyclic scaffold 33. 

a N, 

^^^T^'^Y^"^ ' 9:1.DCM/Pyridlne 

29 31 





5 

preparation of benzopyridodiazepine 

The synthesis of the oxy analog of 33 namely 10/f-Dibenzo[6/|[l,4]oxazepin-ll-one is 
described in the pathway below. Disubstituted 0-aminophenol building unit 35 is attached to 
10 the resin on the Acid sensitive MEthoxy BenzAldehyde (AMEB A)(34) via reductive 
amination , to form 36. 

Resin 36 was further modified with monosubstitued 2-fluoro-5-nitrobenzoic acid 37 using 
HOAt/DIC strategy to afford immobilized substrate 38, which was ready for the assembly of 
the nitro-10//^ dibenz[6^/][l,4]oxazepin-l 1-one analogs 39 (The key cyclization step (SnAt) 
1 5 between the fluor and the phenolic oxygen was performed using a 5% DBU in DMF^^' ^% 
The reduction of the nitro group in the resulting resin can be obtained with the 1.5 M solution 
of SnCl2 H2O in DMF, and subsequent cleavage (TFA/DCM) from the resin 2-amino sub- 
library 39 is obtained. 
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Synthesis of dibenzo-oxazepinone 
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16.6 Pyrazinoqulnazollnone -6,6,6 tricyclic scaffold 




R1 



5 The pyrazino[2,l--b]qiiinazolme-3,6-dione system can be considered as a constrained 

peptidomometic and is present in several families of natural products . Some of these 
compounds exhibit very interesting biological activity (J. Antibiotics 46, 380, 1996, Annu Rev 
Biochem 62 385, 1993). 

One currently known syntheses of this scaffold can be grouped as follows: 
10 a: Transformation of 4-substituted 2,5-piperazinediones into the corresponding iminoethers 
followed by cyclocondensation with anthranilic acid or methyl anthranilate. ^'^ 




iminoether anthranilic acid condensation to Pyrazinoquinazolinone 



15 b: Acylation of 4-substituted 2,5-piperazinedione with o-azidobenzoyl chloride followed by 
Staudinger reaction with phosphine to yield the corresponding y-phosphazene and subsequent 




Pyrazinoquinazolinone via N-o-azidobenzoyl-diketopiperazine 

20 

In a modified reaction sequence the N-o-azidobenzoyl-diketopiperazine is formed via an open 
chain tripeptide where the anthranilic acid unit is the N terminal unit bears an azido group as 
masked amino function^. Cyclization generates the quinazoUnone ring. 
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c: Double cyclization of an open chain tripeptide via 4-imino-4-H-3,l-benzoxazine 
intermediate prepared through cyclodehydration of a suitable o-acylanthranilamide in the 
presence of iodine triphenyl phosphine. 
5 This method was reported in solution ^'^^ as well as on solid phase^"* , which makes it a good 
mean for parralel array synthesis therefore suitable for our purpose. 

EOC 1 -ethyi-3(3-dimethytamino)-propyl)cart>odumide 
R3 




.NHFmoc 



O R4 
1 (up to 5% epimerization) 

Pyrazinoquinazolinone via benzoxazine intermediate. 



10 The tripeptide 6 is prepared by direct coupling of the amino acid esters (AA-OR) 3 

with antraniUc acid mediated by EDC. Condensation of 4 with the Fmoc amino acid chloride 5 
under two phase Scotten-Bauman condition (CH2C12, aq Na2C03) yields the tripeptide 6. 
amino acid chlorides 5 are prepared in situ by pre-activation of the corresponding Fmoc-AA- 
OH with BTC (triphosgene) and coUidine in THF, DCM or Dioxane . These conditions 

1 5 afford AA Clorides without racemization. 

The transformation of the linear tri peptide to oxazine was accomplished vising Wip's 

conditions (PPha /I2 /tertiary amine in large excess) Deprotection followed by rearrangement 

to quinazoline occiirred upon treatment with 20% piperidine in methylene chloride . The 

cyclization to quinazoline is susceptible to steric hindrance and in case of R3,R4 =bulky 
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groups cyclization requires stronger condition (DMAP reflux CH3CN). Some epimerization 
(5%) took place in case in some of the examples. 

The application of the s synthesis in solution described above to combinatorical 
synthesis on solid phase initiates with loading of Wang resin with appropriate amino acid (AA) 
affording 7. For majority of AA the preloaded Wang resin is commercially available. 7 was 
deprotected (piperidine in DMF) and appropriate anthranilic acid along was coupled (EDC) to 
obtain 8( pathway below). 



rOH 



Wang resin 



Triphosgene 
Collidine, DCM 



R3 



1. Piperfdine 
2. antthranilic acid 2. 
EDC 



R3 



NHFmoc 



R4 

HOjC^NHFmoc 

Triphosgene 
Collidine. DCM 



R3 



FmocNH 



R4 



■°r> 



PhaP. I2 
DIEA 



R3 



R1 

6 




NHFmoc R1 



10 



Piperidine 



R3 

il - 



R2 



R1 



R4 



O 



AcCN/DCE 


0 R3 


reflux 24h 














R1 R4 




2 diastereomers 




1 



SPS of Pyrazinoquinazolinone 



The next step is acylation of aniline 7, with Fmoc ~AA-C1 to obtain linear tripeptide 9. 
The next step is the key dehydrative cyclization of linear tripeptide 9 to 10. To ensure 
complete conversion, 10 equivalents of PhaP were used. The final reaction is piperidine 
mediated deprotection of Fmoc group and rearrangement of oxazine 10 to amidine carboamide 
11. After washing, the resin was refluxed in acetonitrile to induce cyclative cleavage of 11 
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obtaining the desired pyrazinoquinozaline library 1. The yields and purity of crude compounds 
were claimed to be relatively high . Final products 1 can be obtained in few cases as mixture 
of CIS : trans diasterioisomers (usually the ratio is 5-8:1). the larger degree of epimerization on 
solid phase is probably due to the cyclizative cleavage, and HT purifier can separate the 
5 products. The above synthesis nicely illustrates the favorable features of the synthetic route. 
The first two steps involve peptide couplings - the reaction for which SPPS was developed 
and which proceeds in ahnost quantitative yield for a variety of amino acids. The dehydration 
of the liner tripeptide 9 requires large excess of PhaP, iodine and TEA - reagents which are 
readily removed by simple filtration on solid phase. The ester functionality undergoing 
10 cyclization in the final step was chosen as the position for solid-phase attachment, resulting in 
self-cleavage firom the resin. 

The synthesis of pyrazinoquinazoline scaffold requires 3 building blocks the 2 amino acids 3,5 
and disubstituted anthranilic acid 2. 



15 The amino acids and the Fmoc —amino acid are conraiercial available. 

In order to introduce hetero functionalities (NH2, OH) to Pyrazine ring (R3, R4) the synthesis 
of protected a-hydroxy- AA 14 and a-amino-AA and 12 should be performed . AA 12 is 
known in literature^^ and the synthesis is illustrated in the pattiway below: 

0 

Toluene, ref. FmocNH 



Fmoo-NHj 



Toluene, ref. 
p-Tos-OH 
HO-tBu 



p-Tos-OH 
HS^Bu 



13 



StBu 



OH 



NBS 
Boc-NH, 



FmocNH 



14 



OtBu 



FmocNH 



■A. 



OH 
NHBoc 



12 



20 



synthesis of protected a-amino-a-OH ainino acids 



Another AA 14 can be prepared by the similar mode through the condensation between 
glyoxylic acid and FmocNH2 in presence of r-BuOH in boiled toluene affording the desired 14. 

25 Out of the third building block 3,5 dimethyl anthranilic acid is commercial the other 
substituted anthranilic acid should be prepared in a tailor-made synthesis . 
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3-methyl-S-phenyl-anthranilic acid 15 can be prepared by bromination of the commercial 
available 3-methyl-anthranilic acid 16^^. Followed by Suzuki reaction^ ^ . 

Br-, MeOH. AcOH 



— cKiLj tLM^ r\ reaction on the ester 

NaB03.AcOH Pd(PPh j,. l^C03 

S^NH, y^NH, IJ^^ 

Ma Ma I. 



COjH 

Atdrich 15 
Preparation of 3-methyl-Salkyl or phenyl antfaranilic acid 



Me Me 

16 

Aldrich 



3,5- diphenyl-anthranilic 17 acid will be prepared from the corresponding 
dibromoanthranilic acid 18 (commercial) via Pd catalyzed cross coupling reaction with excess 
10 of phenyl boronic acid^^ (Aldrich). 

PhB(0H)2( 3eq) 
Pd(PPh3^4lO%mol) 
EtOH. aqNasCOa 

Ph 

Aldrich Aldrich 17 

18 

Preparation of diphenyanthranilic acid 



Substituted anthranilic acid can also be prepared from the corresponding substituted aniline 19 
1 5 using a modified Sandmayer methodology. Reaction of the aniline with chloral and 

hydroxylamine affords the isonitrosoacetanilide followed by cyclization in sulphuric acid 
yields isatinZO. Oxidation of the later with H2O2 affords anthranilic acid 21. (see pathway 
below) 
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a3CCH(OH)2 
NH2OH. Na2S04 ^ 
H20 



J. Med Chem 
^4H2 33. 814. 1990 



R1 



NaOKHjO R2 




21 

Preparation of anthranilic acids via isatin 

Anthralinic acids substituted in position 3 with an OH group 22 can be prepared following 
the reaction sequence described in the pathway below using 4-substituted anilines (Et, Pr, Me 
Aldrich) as starting materials. The aniline was first brominated (23) followed by selectively 
monomethoxylation in the presence of Cul. The 2-bromo-6-methoxy-4-alkylaniline 24 thus 
obtained was carbonylated using Pd complex as catalyst (CO, Pd(PPh3)2Cl2) (=>25)andthe 
final step is deprotection by hydrolysis in concentrated hydrobromic acid^^ 

NaOMe. Cul 

Br2.dloxane complex, dioxane „^ „ Pil!?'' 

^ H,O,>10«C,1hr Rl-spY^^ 1000c. Ihr ^ R1>y^QM^ 



19a 

R1 = Me, Et, Pr (Aldrich) 



23 

1. ACjO 
2. CO (lOatm). PPhj 

PI ^ nw BU3N. (PPh3)2PdCU 

"^"^^^^ Rlx^^-wOMe Hp. 1100c, 18hr 



Br 
24 



CO*" io,H 
22 25 
Preparation of 3-hydroxy-5-alkyl anthranilic acid 



4-alkylaniline 19a can also serve as a starting material for the preparation of dialkyl anthranilic 
acid 26 and S-allcyl 3-phenyl anthranilic acid 27 as described in the pathway below 
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3-alkyl-5-caiboxylanthranilic acid 27 can be prqjared starting from o-alkylaniline 19b that is 
converted to isatin 20a (1. chloral, NH20H, 2. H2S04), followed by bromination and 
oxidation to obtain the 5-bromo anthranilate 28 Substitution of the bromo with cyanide (29) 
and hydrolysis affords the 3-alkyl-S-carboxyl-anthranilic acid" 27. 
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16.7 Pyrrole -5 membered ring scaffold 

Jn this chapter is described the comprehensive synthesis of tetra-substituted pyrroles. 

The proposed ^thetic methods are on Solid Phase (SPS) as well as in solution. 

Ovcrvtew of pyrroles library and sub-libraries 



R3 R2 R3 R2 



R3 R2 d 



R3 R2 



/ ' \ 



HO. 
R4' 



9 f 

Sub-library a which has a carboxyl group at position 2 is prepared in solution. The synthesis 
starts from nitrosation of p-keto esters to obtain oximes 3, which by reductive condensation 
with l,3-4iketones lead to ethyl carboxyketopyrrols 5^ (pathway below). Pyrroles 5 imdergo 
reduction of the carbonyl group to methylene ^, following by hydrolysis of ethyl carboxylate to 
afford the sub-library a. Curtius rearrangement may convert the carboxyl into amine resulting 
in the conversion of sub-library a to sub library b most conveniently, (in case Rl # R2 
mixtures of two isomers are obtained and may be separated ). 



NaNOj, AcOH Q 
COjEt " Rl 



O O 



.JU. 



COjEt 



Zn. AcOH 



R4(Rs)'^j^''^COjEt 



? 

H 



1.NaBH4, 



2. LiOH, 



Sub-library a 



1. DPPAorTMSN3 
2.AcGH,ref. 



Sub-library b 
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Synthesis of sub-libraries a, b. 



10 



Two building blocks are required for the synthesis of sub library a, and b, p-ketoesters, 1,3 
diketones which are mostly commercially available. 

Compoimds of sub-library c can be obtained by the synthetic method described in the pathway 
below. In contrast to the former method this ^proach involves solid phase synthesis (SPS). 
Namely: condensation of i,2-diketones 7 with pre attached Boc imino diacetic acid mono ester 
6 as follows: 



o 

Boc y—o 



LNaOMeorKaBu 
0.5h r^ux 



R4 R3 


1. 2x--^V° 
o 


R4 R3 


2.2XTMSN3 




3.heat,AcOH 


H 


c 




d 



\-J 1. o 



OH 

O NaCNBHg 



o I 



Synthesis of additional ten compounds of category a by SPS. 



The reaction^' ^ is performed xmder basic conditions using NaOMe or KOtBu. Imino diacetic 
acid 6 can be easily prepared from t-Bu ester of Gly by reductive amination of Glyoxylic acid 
using Sodium cyanoborohydride as a reduction reagent and subsequent introduction of a Boc 
1 5 protecting group in multi-gram scale^. 

Sub-library e can be prepared using the method described in flie pathway below. Position 3 in 
tiie resulting products has a fixed subtituent- an hydroxy group. Again, SPS is involved using 
pre-prepared building blocks as described above. 

The process initiates from preparing five acyl Meldrum's acid building blocks (12) in 
20 solution by reaction of acid chlorides 10 with Meldrum*s acid 11 to give, in the presence of 
pyridine the corresponding compound 12 ahnost quantitatively^' ^' 

Thus, heating 12 (5 equiv.) with the hydroxyl resin (the resin which generates carboxylic acid, 

for example the oxime resin^ in THF at reflux for a few hours^ affords the polymer-boimd p- 

ketoesters 13 with concomitant release of CO2 and acetone, which helps to drive the reaction 
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to completion. The reaction could be easily monitored by FT-IR on the resin (KBr pellets). The 
fuctionalization of the a-caibon of 13 is perfomied with excess of the alkylating reagent, 
avoiding O-alkylation as well as double alkylation. 

Thus, haloalkanes (36 equiv.) in the presence of 1 M TBAF* in THF (26 equiv., 3h) easily 
convert 13 to 14 at RT (Fig.4). Typically it is important to exclude traces of water, which may 
decrease the yield. Addition of an excess of presynthesized anuno ketones 15*®'*^ (Fig 5) (20 
equiv., 3h, RT), to the resin linked p-ketoestersl4 in THF/trimethylorthoformate (1/1) gives 
the Shiffbases 16, Cyclization of 16 under basic conditions with concomitant release of the 
product 17 into the solution followed by reduction of the ketone (S3 =Me,Et ) . (NaBH4 
BF30Et2)^ produces sub library e. 



R1-COCi 
10 

THF. Reflux 




Pyridine 
CH2CI2. 0-250C 



R1 A— O 
O 

12 



R1 



13 



R2-X, TBAF 
THF, 25-70 oC 



R2 



CI NH, 



'^-^^R3 
15 



O N 



t4 



THF/fMOF. RT 



R2 



R1 
16 



LDAorLHMDS 



ZnCL 



R2 OH 



O 
17 



NaBH4, BFaOEtj 




The reaction can also be performed in solution using a-substituted p-ketoesters . following the 
same reaction sequence. 

It should be noted that p-hydroxy pyrroles may exist to some extent in its keto tautomo-^^ The 
required building blocks are p-keto esters which are commercial or the a-substituted — p- 
ketoesters. 



145 



wo 2004/010136 W CT/BL2002/000614 
The a-aminoketone building block can be prepared from the corresponding amino acid 
hydroxamate as described in the following pathway. 



I O OMe R1.MgX.THF i O 

1. JL ^ N -20**C-rt J. X ^ R3 HCIInDIoxane oiHJSl^"^'^ 

iTy^^ " ° «^ ^ T 

O O 
18 19 20 

Synthesis of amino ketones from Gly Boc hydroxamates. 



N-protected glycine recats with N-O-dimethyl hydroxyl amine to give hydroxamte 18 
Reaction of the glycine hydroxamate with Gringard reagent (EtMgBr, MeMgBr) affords the 
ketone 19 no over adding is observed. Deprotection of 19 gives the amino ketone building 
10 blocks. 

In case R3= OH, glycinate reacts with the substituted p-keto esters 

Sixteen more products can be obtained by the method described in the pathway below. A key 
step for the preparation of sub library f is Michael addition of amino ketones 21 to DTAD 
(21)^^ . The obtained aminoolefine 23 undergoes cyclization in acidic conditions, to afford the 
15 sub-library £ 

R1 CO^t-Bu ^COAJBu ^ R\ y^02H 



R2 



C02t-Bu H ^ 



21 22 23 f 

LDATHF O H,N^CO,Et ^CO,Et 1-NaOEt ^ 

O R2Br I ru toluene refliix hn R2 2,LIOH.H,onKF V—/ 

R2 R1 CN H ' 

25 26 27 26 



N8NH2.NH3(Dq)4 



r» N8NH2.NH3(Dq)f orLDATHF 

orR1CN.EtONa 



R1 OEt R2 

synthesis of 2-carboxy-3-amino-pyrToles 

20 The synthesis of 2-carboxy, 3-amino pyrroles 28 is well known^^* " (see pathway above). It is 
performed through the enamine formation of 26 and subsequent intramolecular cyclization of 
27 under basic conditions (NaOEt) to give 28. The P-keto nitriles 25. can be prepared by 
either alkylation of 25^^ or acylation of the corresponding nitrile.^^ 
References 
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16.8 Thiophenes and related scaffolds 

The chemistry of 2-aminofhiophenes and related scaffolds has attracted special 
5 attention in the last 30 years because of their ^plications in pharmaceuticals, agriculture, 
pesticides and dyes. 




R 



The chemistry of 2*aminothiophenes are conveniently available through the, synthetic method 

developed by Gewald who devised the most facile and promising synthetic route leading to 

10 2-aminothiophenes A with electron withdrawing substituents such as cyano, carbethoxy etc, in 

the 3- positions and alkyl, aryl, cycloalkyl, and hetaiyl groups in the 4- and S- positions. 
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R1 



S 



R1 pOjEi 





COjEt 



CN 



Amine 



R2 



A 



Gewald reaction 



The simplest version of the Gewald reaction consists of a one-pot procedure, namely 



5 condensation of aldehydes, ketones or 1,3-dicarbonyl compoimds with activated nitriles and 
sulfur in the presence of amine at room temperature. Ethanol, DMF, dioxane are preferred 
solvents and amines like diethylamine, morpholine, or triethylamine have been used ^'^ This 
method offers considerable improvement over other methods by replacing an a- 
mercq)toaldehyde or an a- mercaptoketone by simpler starting materials. It is necessary to use 

10 0.5-1 molar equivalents of aniine based on the amount of nitrile to obtain high yield. 

In another synthesis version a two-step procedure is preferred. An a,b-xmsaturated nitrile is 
jfirst prepared by a Knoevenagle-Cope condensation and then treated with sulfur and an aniine. 
This two-step version of the Gewald reaction gives high^ yields. Alkyl aryl ketones do not 
give thiophenes in the one-pot modification, but gives acceptable yields in the two-step 

1 5 technique^ (see pathway below). 



The use of t-butyl cyanoacetate instead of the ethyl ester enables to obtain free acid of 3- 
20 carboxy-2-aminothiophenes by convenient TFA/DCM hydrolysis* 

The amino acid obtained as well as the protected acid can be used as building blocks for 
further transformation to more complex scaffolds as is exemplified below: 




A 



Two step Gewald reaction 
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R2-'^^^NH, K2CO, acetone 
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30%AcCVAeOH '^''v /^^^ 

Ac O 2 



OEt 



RZ" 



EtONa/EtOH 



reflux 



3 



^a^* KjCO,. MeOH/HjO 

orUOH.7HF/HaO 



COjH 



B1 



''''v/CN 30%AcCI/AcOH .CN 



1. 

0 



A1 



.^^ I^H EtONa/EtOH. reflux 
Ac 2.UOH.THF/H2O 




1, ^^""^ ' 2. K2C03 acetone 



3.IJOH,THF:H20 



Thienopyrrole synthesis 



5 Thienopyrrole scaffold B (pathway above) is prepared by the reaction of aminocarboxylate A 
with bromoacetate (K2CO3) to obtain diester intermediate 1, which after acetylation 
(compound 2) (30% AcCl in AcOH) imdergoes Dieckmann condensation (EtONa, EtOH) to 
afford 3-hydroxy-2-cjEu:boxy thieno[2,3-b]pyrrole Bl. The amino analog B2 requires starting 
with the 2-amino-3-cyano thiophene Al. Acetylation followed by alkylation with a- 
10 bromoacetate (K2C03 acetone or NaH DMF) leads under similar reaction conditions to ring 
closure producing 3-amino-caboxy thienopyrrole B2. Acetylation of the amine at position 2 
and LiOH .are required to increase the nucleophiUcity of the amine . 

16.8.2 5»6-bicyclic scaffolds 

The thienopyridine scaffold C is prepared via modified Friedlander reaction , namely reaction 
15 of thiophene A, Al and 5 with b-ketoesters , 1,3 diketones under basic condition to form 
thienopyridines as described in the pathway below 
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C6 5 

C3 

Thienopyridine synthesis 



Another 5,6-bicyclic ring system - the thieno pyrimidine D is prepared by the reaction of 
thiophene A ,Alwitii chloro fonnamidine hydrochloride 4, 1 1 




HO 



D 

Thienopyrimidine synthesis 

16.8.3 5,8,5 5.8,6 tricyclic and 5,5,8,6 5.5.8.5 tetracyclic scaffolds 

The scaffolds E,F G and H can be generated from thiophenes are described in scheme 6. These 

compoimds resuh from the formation of an eight membered ring Dilactam. 
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H 



preparation of 8 membered ring dilactam 

The formation of the eight membered ring includes several steps: 

1. Activation of the p-antiino acid using SOC12'^* or POC13^^^(in these cases tiie amine 
should be protected by Boc ) or by DCC*^*^ and methyl chloroformate^^** 

2. Coupling of the activated acid and another N-protected P-amino-t-butyl ester^^' 

3. Deprotection of the t-butyl ester and the N-Boc amine using TFA in DCM 

4. Coupling by PyBop or any other analog in case R' is a benzyl group it can be removed 
at this stage by hydrogenation, 

16.8.4 5,7 bicyclic scaffold 

The synthesis of I, J analogs of the benzodiazepines scaffold is illustrated in the following 
pathway. In both approaches chiral anmo acid are introduced into the synthesis raising the 
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diversity around the a carbon. Thieno diazepine I is prepared from 2-amino-3-acyl)- 
thiophenes 5. which reacts with pre formed Boc amino acid chloride (amino acid, BTC, 
collidine, THF or DCM). Deprotection of 8 (4NHC1) with concomitant ring closiire leads to 
2-oxothienodiazepine L Thiophenodiazepine J can be prepared starting from 2-amino-3- 
carboxy-thiophenes A, which after pre activation to the thienooxzaine dione (BTC, collidine, 
THF or DCM)) reacts with amino ketone to obtain 9, ring closure afford S-oxothienodiazepine. 



J 



14 



R2 




8 



.NHBoc 



4N HCI/dioxane 




I 




A 



R4 



J 



K2C03 



9 



synthesis of thienodiazepines 
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1. benzylaxycarbonylamtnoaoetylchioride. CHCI3 
2. 20% HBr AcOH 

2.N2H4 

lLJLJN-9Hcoa 




acHRcoa 



NHCOCHRa 





1. Nal, 
Z NH3 



10 



11 



pyrldineAcOH 
benzene reflux 



The synthesis of thienodiaepine K 

The synthesis of thienodiazepine K is described in the above pathway. 2-anuno-3-acyl 
tfaiophene 5 is first acetylated with the appropriate a-haloacetyl chloride Nucleophilic 
substitution with Nal followed by anunonia to obtain the amino amide^^ 1 1 . The latter 
undergoes ring closure to the thienodiazepine K under acidic conditions Another alternative . 
is to react thiophene S with the phthalide protected amino acyl chloride, Deprotection witii 
hydrazine (1 1) and ring closure to obtain thienodiazq)ine K 



The synthesis of thienozepine L is based on coupling of sucssinic anhydride or acid chloride 
monoester with thiophene 5 (see pathway below) The obtained amide 12 imdergoes 
intramolecular condensation (NaH) to provide the targeted compounds^^. 



ri^Vr3 lNaH 



^-^^-J^""^' 2.LIOH 
12 




preparation of thienozepine 



Scaffold M, having a thienodiazepinone skeleton may be prepared as described in the 
pathway below. The N-protected aminocarboxythiophene A is first preactivated (BTC, 
collidine, DCM) and submitted to reaction with a-amino acetonitriles 14 to afford amide 13. 
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The latter reacts under basic conditions (NaOMe) to provide through the intramoleciilar 
cyclization the disubstituted intennediate 2-aminothieno-l,4-diazepin-5-one 15^^ . In the next 
step 2-aminothieno-l,4-diazepin-5-one 15 is heated with acetyl hydrazine leading to 
thienotriazolodiazepinone M 

o^ 

R1 



NH 
boc 



1.BTC, Collldine, PCM 
R3 

1. )-CN 

14 



R2 



f^l ^-fj 1.4NHCI/dioxano R1 y^^\^RS 

J^J^^^ 2.NaOMe.MeOH ^J^X-n^m 
boc 

13 





15 




R4CONHNH2 




HMPA, heat 










M 


R4 



The synthesis of thienotriazolodiazepinone 



16.8.5 5,6,5,6 Tetracyclic and 5,6,5 tricyclic scaffolds 

The thiophene substituted in the 3 position with a benzimidazole namely 

benzimidazoloaminothiophene 16 can serve as a building block for the synthesis of 

thieno(2%3*,4,5)pyrimidino(l,6)benzimidazole,N Nl, The starting material 2- 

cyanomethylbenzimidazole 16,is prepared from substituted phenylene diamine 17 and 

malononitrile ^^Nitrile 18 is submitted to Gewald reaction using elemental sulfur powder and 

ketones or cyanoacetamide in dry DMF containing a catalytic amount of TEA under reflux 

to form thiophene 16 (see pathway below). 
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17 



1. NC^CN 



R3 



R3. R4 



R1 



R2 



Se. amine ^ 




R4 

1/ OHC, 



R5 



N^NH 



18 



So-" 

J. 



DMF. pip.,heat 



16 
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The synthesis of thienopyrumdinobenzimidazole 



Condensation of 16 with aldehydes or ketones, afford N and Nl respectively 
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R2 



N^NH 

20 



Sq, amine 



OHC^ 



R5 



R2 



DMF. p(p.,heat 



19 



R2 



R4 



oi 



R5 



R2 



R4 

Ri y-N 



The synthesis of thienopyrmudinodihydioimidazole 



Using the same approach dihydroimidazoylacetonitrile 20 ^^(see pathway above) and 
thienoimidazoyl-acetonitrile 2 1 (see pathway below) can be prepared from the corresponding 
diamines (ethylene diamine and thiophene2,3 diamine^^) and malonohtrile The resulting 
nitriles react with ketones under Gewald conditions forming O, Ol and P, PI . 

HjN NH, 



NC^CN 



* N^NH ^ 
21 



R2 



R4 R3 

VV^H DMF, pip.,h8at 

R1-^S NHj 



22 



RS'^RS 



R1 



R2 



%^R3 

HiNry,s 

^^N^R5 



RI 



R4 



RZ-^s'^tJ R5 



PI 
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The synthesis of scaffolds P and PI 



16.8.6 5-6-5-6 tetracyclic scaffold 

R3 1BTC 



2. POCI, 



R1 



pOJEX 



R4 

23 



1800C, 2h 




X^SorN 



synthesis of scaffold Q 



4H-tbieno[2^3^4,5]pyrimido[2J-b]benzothia-o^--zolesQcan be prepared from amino 
thiophene A as outlined in the pathway above.^^ 2-Amino-3-carboxythiophene undergoes 
condensation at high temperature with chlorobenzimidazole^^. chlorobenzthiazole 23 leading 
to the corresponding tfaienopyrimidinazoles Q. 

16.8.7 5-6-5 tricyclic scaffold 

Thia-triaza-s-indacenone R (see pathway below), can be obtained according to literature 
procedures Li this synthesis the aminothiophene A imdergoes cyclization in boiling acetic 
acid with pre formed methylthio imidazoles 24 to give the desired system. R 



R2 co,et 



24 



R4 

















R 
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It will be appreciated that the above described methods of target measurement and drug 
discovery may be varied in many ways, including, changing the order of steps, which steps are 
performed on-line and which steps are performed off-line. In addition, various parallel and/or 
sequential configurations may be used to implement the above invention, optionally utilizing a 
variety of software tools and/or various hardware/software combinations. In addition, a 
multiplicity of various features, both of methods and of devices has been described. It should 
be appreciated that difiTerent features may be combined in different ways. In particular, not all 
the features shown above in a particular embodiment are necessary in every similar exemplary 
embodiment of the invention. Further, combinations of the above features are also considered 
to be within the scope of some exemplary embodiments of the invention. Also within the 
scope of the invention are computer readable media on which software, for performing part or 
all of an exemplary embodiment of the invention, are written. It should also be appreciated that 
many of the embodiments are described only as methods or only as apparatus. The scope of the 
invention also covers hardware and/or software adapted and/or designed and/or programmed to 
carry out the method type embodiments. In addition, the scope of the invention includes 
methods of using, constructing, calibrating and/or maintaining the apparatus described herein. 
Headers, where they appear, are provided for ease of browsing and should not be construed as 
necessarily limiting the contents of the section to that which is suggested by the heading. When 
used in the following claims, the terms "comprises", "comprising", "includes", "including", 
"having" or their conjugates mean "including but not limited to". 

It will be appreciated by a person skilled in the art that the present invention is not 
limited by what has thus far been described. Rather, the scope of the present invention is 
limited only by the following claims. 
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CLAIMS 



1. A method of obtaining information about a chemically active area of a target molecule, 
comprising: 

5 providing a set of substantially rigid chemical gauges; 

reacting said target with a plurality of gauges of said set of gauges; 

assaying a binding of said gauges with said target to obtain a plurality of assay results; 

and 

analyzing said assay results to obtain information about said chemically active area. 

10 

2. A method according to claim 1, wherein said gauges allow rotation of moieties of said 
gauges. 

3. A method according to claim 1, wherein said gauges are constmcted using a rigid 
15 scaffold. 

4. A method according to claim 1, wherein constituent atoms of said gauges do not move 
more than 1 A unless at least 20Kcal/Mol are £q>plied to the gauge. 

20 5. A method according to claim 1, wherein analyzing comprises identifying a plurality of 
spatial and chemically specific bindings configurations in said target active area, 

6. A method according to claim 5, wherein said configurations comprise triangular 
configurations. 

25 

7. A method according to claim 5, wherein identifying comprises identifying a 
configuration that matches a configuration of a bound gauge. 

8. A method according to claim 5, wherein identifying comprises identifying a 
30 configuration that does not match a configuration of a bound gauge. 



9, A method according to claim 8, wherein identifying comprises identifying by statistical 
analysis of said assay results. 

161 



319/02004 

WO 2004/010136 





T/ILOZ/ OU 0 I t 



PCT/IL2002/000614 



10. A method according to claim 9, wherein identifying comprises identifying by 
clustering. 

5 11. A method according to claim 5, wherein identifying comprises assuming each gauge 
indicates a single configuration. 

12. A method according to claim 5, wherein identifying comprises assuming at least some 
of the gauges indicate a plurality of configurations. 

10 

13. A method according to claim 5, wherein identifying comprises classifying gauges by 
chemical moieties at vertexes of said configurations. 

14. A method according to claim 1, comprising reconstructing a spatial map of at least part 
15 of said chemically active area, firom at least two of said assay results, said part including at 

least four chemical binding areas. 

15. A method according to claim 14, wherein said part includes at least six chemical 
binding areas. 

20 

16. A method according to claim 5, comprising reconstructing a spatial map of at least part 
of said chemically active area, firom at least two of configurations, said part including at least 
four chemical binding points. 

25 17. A method according to claim 16, wherein said part includes at least six chemical 
binding areas. 

18. A method according to claim 16, wherein reconstructing comprises: 



30 



test-reconstructing a plurality of spatial maps from said configurations; 

scoring said maps; and 

selected a spatial map based on its score. 



19. 



A method according to claim 16, wherein reconstructing comprises: 
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test-reconstructing a plurality of spatial maps from said configurations; 
clustering said maps according to common substructures; and 
selected a spatial map based on a relative property of a cluster it belongs to. 

5 20. A method according to claim 19, wherein said relative property comprises size. 



21. A metiiod according to claim 16, wherein said spatial map includes enough binding 
points to ensure binding of a small molecule drug having a chemical profile matching the 
binding points. 

22. A method according to claim 21, wherein said spatial map includes at least 6 binding 
points. 



23. A method according to claim 21, wherein said spatial map includes at least 8 binding 
15 points. 

24. A method according to claim 1, wherein said set of gauges comprises a set of gauges 
with at least 10,000 gauges. 

20 25. A method according to claim 1, wherein said set of gauges comprises a set of gauges 
with at least 50,000 gauges. 

26. A method according to claim 1, wherein said gauges comprise moieties arranged in 
spatial configurations and wherein said gauges are selected to span a virtual space of spatial 
25 chemical configurations. 



27. A method according to claim 1, wherein substantially each point of virtual space that is 
spanned by said gauges is covered by at least two gauges. 

30 28. A method according to claim 1, wherein substantially each point of virtual space that is 
spanned by said gauges is covered by at least three gauges. 
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29. A method according to claim 1, wherein at least 0.5% of said gauges bind with said 
target. 

30. A method according to claim 1, wherein at least 1% of said gauges bind with said 



31. A method according to claim 1, wherein at least 3% of said gauges bind with said 
target. 

10 32. A method according to claim 1, wherein at least 50% of said gauges are defined by 
adding moieties to a set of fewer than 100 scaffolds. 

33. A method according to claim 1, wherein at least 50% of said gauges are defined by 
adding moieties to a set of fewer than 50 scaffolds. 

15 

34. A method according to claim 1, wherein at least said set of gauges uses fewer than 15 
different chemical moieties to define the chemical behavior of said gauges. 

35. A method according to claim 1, wherein at least said set of gauges uses fewer than 10 
20 different chemical moieties to define the chemical behavior of said gauges. 

36. A method according to claim 1, wherein said assay is a fimctional assay. 

37. A method according to claim 1, wherein said assay is a binding assay. 

25 

38. A method according to claim 1, wherein said assay is a cellular assay. 

39. A method according to claim 1, wherein said assay is a flow-tiirough assay. 

30 40. A method according to claim 36, wherein said fimctional assay is performed in the 
presence of a natural substrate of said target. 



5 target. 



164 



319/02004 

WO 2004/010136 





PCT/IL2002/000614 



T/IL U < / U U O I -I 



41. A method according to claim 1, wherein said target comprises a protein including a 
biochemically active area adapted to engage a substrate. 

42. A method according to claim 41, wherein said chemically active area comprises an area 
5 including said biochemically active area. 

43. A method according to claim 41, wherein said chemically active area comprises a 
control area of said protein. 

10 44. A method according to claim 1, analyzing comprises analyzing successful binding of at 
least 60 gauges. 

45. A method according to claim 1, analyzing comprises analyzing successful binding of at 
least 10 gauges. 

15 

46. A method according to claim 1, analyzing comprises analyzing successful binding of at 
least 100 gauges. 

47. A method according to claim 5, wherein identifying comprises identifying at least 40 
20 difTerent configurations. 

48. A method according to claim 5, wherein identifying comprises identifying at least 10 
different configurations. 

25 49. A method according to claim wherein identifying comprises identifying at least 100 
different configurations. 

SO. A method according to claim 16, comprising: 



30 



comparing said map to a lead data base; and 

selecting a lead firom said data base for further use responsive to a semblance or lack of 
semblance between said lead and said map. 



51. 



A method according to claim 16, comprising: 
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comparing said map to a lead data base; and 

rejecting a lead from said data base for further use responsive to a semblance between 
said lead and said map. 

52. A method according to claim 16» comprising: 
constructing a lead to have a semblance to said map. 

53. A method according to claim 52, wherein constmcting comprises constructing using 
said gauges or scaffolds used to define said gauges. 

54. A method according to claim 5, comprising: 
comparing said configurations to a lead data base; and 

selecting a lead from said data base for Anther use responsive to a matching of said 
configurations to said lead. 

55. A method according to claim 5, comprising: 
constructing a lead based on said configurations. 

56. A method according to claim 5, comprising: 

selecting at least one of said gauges as a lead for drug discovery. 

57. A method according to claim 1, comprising comparing the binding of gauges with 
similar binding geometries to obtain steric clashing data; and 

analyzing said steric clashing data to provide geometrical information about said target. 

58. A method of identifying the existence of a plurality of chemical-spatial configurations 
in a target, comprising: 

assaying the target with a plurality of gauges having know chemical-spatial 
configurations at vertexes thereof, to provide a plurality of assay results; 

defining an array of spaces, one space for each set of chemical behaviors of the 
vertexes of each configuration; 

indicating said results according to said spaces, to generate clusters; and 

identifying the existence of a configuration in said target from said clusters. 
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59. A method according to claim 58, wherein indicating comprises spreading an indication 
responsive to a spreading function. 

60. A method according to claim 59, wherein said spreading function is dependent on an 
estimated energy of binding of a gauge to said target. 

61. A method of reconstructing a spatial shape of a chemical binding configuration of a 
target firom a set of sub-shapes, each of which indicates a part of said binding configuration, 
comprising: 

selecting a base fix>m said sub-shqi^; 

selecting at least two sub-shapes having the property that they match each other at least 
along one side thereof and match said base along another side thereof; 
accumulating said sub-shapes to said base; and 

repeating said selecting and said accumulating imtil all of said sub-shapes are used or 
cannot be used, thereby providing a shape of a binding configuration of said target. 

62. A method according to claim 61, con^rising variationally repeating said selecting, 
accumulating and repeating using a different order of selection of sub-shapes. 

63. A method according to claim 62, comprising repeating said selecting a base and said 
variationally rq)eating for a plurality of different base selections. 

64. A method according to claim 63, comprising clustering a pluraUty of such shs^jes 
according to shared sub-component sh^es. 

65. A method according to claim 64, comprising selectmg a sub-component shape as a 
resulting sh£q)e based on said clustering. 

66. A method according to claim 61, wherein said sub-sh^es comprise triangles. 
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67. A method according to claim 61, wherein said sub-shapes define chemical behavior at 
their vertexes and wherein two sides are said to match if the chemical behavior at their 
vertexes match. 

68. A method according to claim 61, wherein two sides are said to match if their length is 
similar. 

69. A method of selecting a scaffold for use in generating a part of a screening library, 
comprising: 

providing a potential scaffold molecule including a plurality of possible attachment 
points for moieties; 

deteraiining a rigidity of the molecule; and 

rejecting said potential scaffold molecule responsive to a lack of rigidity of said 



70. A method according to claim 69, wherein said lack of rigidity is absolute. 

71. A method according to claim 69, wherein said lack of rigidity is relative to other 
potential scaffolds. 

72. A method according to claim 69, comprising selecting a scaffold based on a number of 
rings thereof. 

73. A method according to claim 69, comprising: 

determining a plurality of gauge molecules that can be generated by adding moieties to 
said potential scaffold molecule; 

detemiining for an existing library portion what spatial chemical configurations are 
added by said molecules; and 

selecting said potential scaffold molecule if one or more significant spatial chemical 
configurations can be added by it to said library portion. 



scaffold. 



74. A method according to claim 73, comprising selecting a scaffold based on a number 
of configurations added by said scaffold. 
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75. A method according to claim 73, wherein said significant spatial configurations are 
configurations not previously provided or overlapped with. 



76. A method of selecting a gauge molecule to be added to a screening library, comprising: 
providing a set of chemical molecules and at least a part of a screening Ubrary, 
selecting a potential gauge molecule fi-om said set of chemical molecules; 
detemiining a rigidity of said potential gauge molecule; and 

rejecting said potential gauge molecule responsive to a lack of rigidity of said gauge 
molecule. 

77. A method according to claim 76, wherein said lack of rigidity is absolute. 

78. A method according to claim 76, wherein said lack of rigidity is relative to other 
potential scaffolds. 



79. A method according to claim 76, comprising: 

determining a spanning, in chemical configuration space, of said part of a screening 

library, 

determining at least one spatial chemical configuration of sdd potential molecule; and 
selecting said potential gauge molecule if it adds at least one significant spatial 
chemical configuration to said screening library. 

80. A method according to claim 76, wherein providing a set of molecules comprises 
generating said molecules using a single scaffold to which moieties are selectively attached. 

81. A method according to claim 76, wherein providing a set of molecules comprises 
providing a chemical library. 

82. A method according to claim 79, wherein said gauge is selected if it adds at least one 
spatial chemical configuration not previously provided or overlapping a provided 
configuration. 
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83. A method of creating at least a portion of a screening library, comprising: 
selecting a scaffold molec\ile to which moieties can be added; 

determining a plurality of potential gauges which can be created by attaching moieties 
to said scaffold; and 

5 selecting a subset of said gauges that do not substantially overlap in chemical 

configurations. 

84. A method according to claim 83, comprising: 

rejecting potential gauges that add over six spatial chemical configurations. 

10 

85. A method of reducing a screening library, comprising: 

for each molecule in at least part of said Ubrary, determining substantially all the spatial 
chemical configurations of a certain order of binding points provided by the molecule; and 

removing a plurality of molecules which add redundant spatial chemical 
15 configurations. 

86. A method according to claim 85, wherein said certain order is three. 

87. A method of reducing a screening library, comprising: 

20 for each molecule in at least part of said library, calculating a bmding probability of 

said molecules based on energetic considerations; and 

removing at least some molecules whose binding probability is below a threshold 

value. 

25 88. A method according to claim 87, wh^ein said binding probability is calculated using a 
formula which is invei:sely dependent on a flexibility of the molecule. 

89. A method according to claim 87, wherein said binding probability is at least estimated 
based on a solubility of the molecule. 

30 

90. A method of designing a screening hbrary for a projected target molecule task, 
comprising: 
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detennining a desired range of distances between binding points to be directly 
identified by said library; 

determining a desired overlap between measures provided by gauge molecules of said 

librar)^ 

determining a set of desired binding types to be discriminated between; and 
generating a plurality of gauges, said gauges each defining a plurality of binding types 
and distances between them, such that said gauges cover a spatial chemical configuration space 
that includes said distances and said binding types with said desired overlap. 

91: A method according to claim 90, wherein generating a plurality of moieties comprises 
generating by attaching moieties to scaffolds. 

92. A method according to claim 90, wherein said gauges cover a spatial chemical 
configuration space of triplets of binding points. 

93. A method according to claim 90, wherein said projected target molecule task comprises 
proteins. 

94. A method according to claim 90, wherein said overlap is at least two. 

95. A method according to claim 90, wherein said overlap is at least four. 

96. A method according to claim 90, wherein said overlap is at least six. 

97. A method according to claim 90, wherein said gauges are substantially rigid. 

98. A method according to claim 90, wherein said cov^age takes into account an inherent 
flexibility of binding. 

99. A method according to claim 90, wherein generating comprises generating 
substantially same configurations by different gauges, thereby providing at least part of said 
overlap. 
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100. A method according to claim 99, wherein generating comprises providing a repetition 
factor of at least two. 

101. A method according to claim 90, wherein generating comprises generating 
5 substantially different configurations by different gauges, which different configurations 

overlap due to a degree of flexibility thereof, thereby providing at least part of said overlap. 

102. A method according to claim 1, comprising generating a set of dmg leads for said 
target based on said information. 

10 

103. A method according to claim 102, comprising removing known drug leads for said 
target fi'om said set. 

1 04. A lead set produced by the method of claim 1 02. 

15 

105. A lead set produced by the method of claim 103. 

106. A drug lead comprising: 



107. A screening library comprising: 

at least 10,000 molecules generated by attaching moieties to a set of fewer than 50 
25 scaffold molecules. 

108. A screening library according to claim 107, wherein fewer than 20 scaffold molecules 
are used to generate said at least 10,000 molecules. 

30 109. A library according to claim 107, wherein said scaffolds include at least one of the 
following scaffold molecules: Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; Pyrimidine; 
Pyrazine; 6H-Thieno[2,3-b]pyTrole; 1 ,6-Dihydro-pyrrolo[2,3-b]pyrrole; IH-Indole; 
Thieno[2,3-d]pyrimidine; 6,7-Dihydro-pyrazolo[ 1 ,5-a]pyrimidine; Quinoline; Isoquinoline; 



20 



a plurality of substantially rigid scaffolds molecule sections; 

at least one link interconnecting said scaffold molecule sections; and 

a plurality of moieties attached to said scaffolds. 
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Quinoxaline; 3,4-Dihydro-benzo[e][l,4]diazepin-5-one; 3,8-Dihydro-4H-pyrTolo[2,3- 
e][l,4]diazepm-5-one; 3,4-Dihydro-thieiio[2,3-e][l,4]diazepin-5-one; 3,6-Dihydro-4H- 
pyirolo[3,2-e][l,4]diazepin-5-one; 5H,1 lH-Diben2o[b,f|[l,5]diazocine-6,12-dione; 1,4- 
Dihydro-10H-l,4,10-l,4,10-triaza-ben2o[a]cyclopenta(e]cyclooctene-5,ll-dione; 4H,10H-1- 
Thia-4,10-diaza-benzo[a]cyclopenta[e]cyclooctene-S,ll-dione; Dipyrrolo[l,2-c;2',r- 
e]imidazol-5-one; l,4,7,9-Tetrahydro-l,4,6,9-tetraaza-dicyclopenta[a,e]cyclooctene-S,10- 
dione; 4J,9-Trihy<bo-l-thia-4,6,9-triaza-<Mcyclopenta[a,e]cyclooctene-5,10-dione; 
2,4,9,Trihydro-llambdaM*,6HiilMa-4,9-diaza-dicyclopenta[a,e]cyclooctene-S40-dione; 6,9- 
Dihydro-5H-l-thia-5,8,9,triaza-cyclopenta[a]azulen-4-one; '3,10,Dihydro-4H- 
[l,4](iiazepino[5,6-b]mdol-5-one; 3,6-Dihydro-4H-[l,4]diazepino[6,5-b]mdol-5-one; 7,8- 
Dihydro-lH-l,7,10-triaza-cyclohepta[e]inden-6-one; 8,9-Dihydro-3H-3,6,9-triaza- 
cyclohepta[e]inden-10-one; 7,8-Dihydro-lH-l,5,8-triaza-cyclohepta[f]inden-9-one; 8,9- 
Dihydro-5,6,9,ll-tetraaza-cyclohq)t[b]naphthalene-10-one; 3,4-Dihydro-[l,4]diazepino[5,6- 
b]qumolin-5-one; 8,9-Dihydro-4,8,ll-triaza-cyclohq)ta[a]n^hthalene-7-one; llH-10,11- 
Diaza-baizo[b]fluorme; a-hydroxyacids; a-aminoacids; cohels; Bicyclo[2.2.2]octane; 2- 
Mefliylene-2,3-dihydrobaizo[l,4]dioxine; 6,7-r)ihydio-2H-pyrazino[l,2-a]pyrainidine; 9H- 
Fluorene; l,4-Dia2a-bictclo[2.2.2]octane; l-Aza-bicyclo[2.2.2]octane; Pyrido[2,3- 
d]pyrimidine; 5-Methylene-l,5-dihydro-pyrrol-2-one; Be2no[4,5]imidazo[l,2-a]pyriimdine; 
l,4-I)ihydro-benzo[4,5]iinidazo[l,2-a]pyriinidine; 4,10-Dihydro-l,4a,10-triaza-phenanthren-9- 
one; l,5-Dihydro-iniida2o[l,2-a]pyriinidin-2-one; l,2,3,5-Tetrahydro-iinidazo[l,2- 

a] pyrimidine; Thiazolo[3,2-a]thieno[2,3-d]pyriinidin-5-one; l,9-Dithia-4a,10-diaza- 
cyclopenta[b]fluoren-4-one; 5,6-Dihydro-l-thia-5,7,8,9a-tetraaza-cyclopenta[e]azulen-4-one; 
6,10-Dihydro-5H-l-thia-5,7,10a-triaza-benzo[e]azulen-4-one; 4,5-I>ihydn)-3-thia-4,5a,10- 
tria2a-cyclopenta[a]fluorine; 8H-l-Thia-cyclopenta[a]mdene; 3-Thia-4,5a,10-triaza- 
cyclopenta[a]fluorine; 6,7,9,ll-Tetrahydio-10-thia-6,9-diaza-indeno[l,2-a]a2ulene-5,8-dione; 
2,3,6,7,12a-Hexahydropyrazino[l',2':l,6]pyrido[3,4-b]indole-l,4-dione; 5,10-Dihydro-4H- 
2,3a,10-tria2a-cyclopenta[a]fluorine; 5H-Pyrido[4,3-b]mdole; 1 lH-Indolizmo[l,2-b]quinolin- 
9-one; l,2-Dihydro-2,4a,9,-triaza-anthracene-3,10-dione; 6H-Isoindolo[2,l-a]mdole; 1,5- 
Dihydro-benzo[b][l,4]diazepin-2-one; 5,10-Dihydro-dibenzo[b,e][l,4]diazepm-l 1-one; 5,1 1- 
Dihydro-benzo[e]pyrido[3,2-b] [ 1 ,4]diazepin-6-one; 4,9-Dihydro-3-thia-4,9-diaza- 
benzo[f|azulen-10-one; Benzo[g]quinoxaline; Pyrazmo[2,3-b]qumoxalme; Pyrido[2,l- 

b] quinazolin-l 1-one; l-Thia-4a,9-dia2a-cyclopenta[b]njq)hthalraie-4-one; 2-Methylene-4H- 
benzo[ 1 ,4]thiaziii-3-one. 
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110. A library according to claim 107, wherein at least 4 of said scaffolds have exactly a 
single ring. 

1 1 L A library according to claim 107, wherein at least 4 of said scaffolds have exactly two 
rings. 

112. A library according to claim 107, wherein at least 4 of said scaffolds have exactly three 
rings. 

113. A library according to claim 107, wherein at least 4 of said scaffolds have exactly four 
rings. 

114. A library according to claim 107, wherein said library includes at least 50,000 thus 
generated molecules. 

115. A library according to claim 107, wherein said Ubrary includes at least 100,000 thus 
generated molecules. 

116. A library according to claim 109, wherein said scaffolds include at least three of said 
following scaffold molecules. 

117. A library according to claim 109, wherein said scaffolds include at least ten of said 
following scaffold molecules. 

118. A library according to claim 107, wherein said generated molecules are substantially 
rigid- 

119. A library according to claim 107, wherein said molecules span a configuration space of 
spatial geometrical patterns of binding point types, mcluding at least 25% of the patterns that 
exist in protein targets. 
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120. A library according to claim 119, wherein said molecules span at least 50% of the 
patterns. 

121. A library according to claim 1 19, wherein said molecules span a space defining at least 

4 distinct binding point chemistry types. 

122. A library according to claim 1 19, wherein said molecules span a space defining at least 

5 distinct binding point chemistry types. 

123. A screening library, comprising: 

at least 100 gauge molecules generated by attaching moieties to at least one of the 
following scaffolds: Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; Pyrimidine; Pyrazine; 
6H-Thieno[2,3-b]pyrrole; l,6-Dihydro-pyrrolo[2,3-b]pyrrole; IH-Indole; Thieno[2,3- 
d]pyrimidine; 6,7-Dihydro-pyra2olo[l,5-a]pyrimidine; Quinoline; IsoquinoUne; Quinoxaline; 
3,4-Dihydro-benzo[e][l,4]dia2epin-5-one; 3,8-Dihydro-4H-pyrrolo[2,3-e][l,4]diazepin-5-one; 
3,4-Dihydro-thieno[2,3-e][l,4]dia2epin-5-one; 3,6-Dihydro-4H-pyrrolo[3,2-e][l,4]diazepin-5- 
one; 5H,llH-Dibenzo[b,£][l,5]dia2ocine-6,12-dione; l,4-Dihydro-10H-l,4,10-l,4,10-triaza- 
benzo[a]cyclopenta[e]cyclooctene-5, 1 1-dione; 4H, 1 OH-1 -Thia-4, 1 0-diaza- 

benzo[a]cyclopenta[e]cyclooctene-5,ll-dione; Dipyrrolo[l,2-c;2*,r-e]imidazol-5-one; 
1 ,4,7,9-Tetrahydro- 1 ,4,6,9-tetraaza-dicyclopenta[a,e]cyclooctene-5, 1 0-dione; 4,7,9-Trihydro- 
1 -thia-4,6,9-triaza-dicyclopenta[a,e] cyclooctene-5 , 1 0-dione; 2,4,9,Trihydro- 1 lambda*4* ,6- 
dithia-4,9-diaza-dicyclopenta[a,e]cyclooctene-5,l 0-dione; 6,9-Dihydro-5H-l-thia-5,8,9,triaza- 
cyclopenta[a]azulen-4-one; 3,10,Dihydro-4H-[l,4]diazepino[5,6-b]indol-5-one; 3,6-Dihydro- 
4H-[l,4]diazepino[6,5-b]indol-5-one; 7,8-Dihydro-lH-l,7,10-triaza-cyclohepta[e]inden-6-one; 
8,9-Dihydro-3H-3,6,9"triaza-cyclohepta[e]inden-10-one; 7,8-Dihydro-lH-l,5,8-triaza- 
cyclohepta[f|inden-9-one; 8,9-Dihydro-5,6,9,ll-tetraaza-cyclohept[b]naphthalene-10-one; 3,4- 
Dihydro-[l,4]diazepino[5,6-b]quinolin.5-one; 8,9-Dihydro-4,8,ll-triaza- 
cyclohepta[a]naphthalene-7-one; 1 lH-10,1 l-Diaza-benzo[b]fluorine; a-hydroxyacids; a- 
aminoacids; cohels; Bicyclo[2.2.2]octane; 2-Methylene-2,3-dihydrobenzo[l,4]dioxine; 6,7- 
Dihydro-2H-pyrazino[l,2-a]pyramidine; 9H-Fluorene; l,4-Diaza-bictclo[2.2.2]octane; l-Aza- 
bicyclo[2.2.2]octane; Pyrido[2,3-d]pyrimidine; 5-Methylene-l,5-dihydro-pyrTol-2-one; 
Bezno[4,5]imidazo[ 1 ,2-a]pyrimidine; 1 ,4-Dihydro-benzo[4,5]imidazo[ 1 ,2-a]pyrimidine; 4,1 0- 
Dihydro- 1 ,4a, 1 0-triaza-phenanthren-9-one; 1 ,5-Dihydro-imidazo[ 1 ,2-a]pyrimidin-2-one; 
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l,23,5-Tetrahydro-iinidazo[l,2-a]pyrimidme; TMazolo[3,2-a]thieno[23-d]pyriimdin-5-one; 

1.9- Dithia-4a,10-dia2a-cyclopenta[b]fluoren-4-one; 5,6-Dihydro-l-thia-5,7,8,9a-tetraaza- 
cyclopenta[e]azulen-4-one; 6,10-Dihydro-5H-l-thia-5,7,10a-triaza-benzo[e]azulen-4-one; 4,5- 
DihydroO-thia-4,5a,10-triaza-cyclopenta[a]fluorine; 8H-l-Thia-cyclopenta[a]indene; 3-Thia- 
4,5a, 10-tria2a-cyclopenta[a]fluorine; 6,7,9,1 l-Tetrahydro-10-thia-6,9-diaza-indeno[l,2- 

a] azulene-5,8-dione; 2,3,6,7, 1 2a-Hexahydropyra2ino[ l\2':l ,6]pyrido[3,4-b]indole- 1 ,4-dione; 

5.10- DihydK)-4H-2,3a,10-triaza-cyclopenta[a]nuorine; 5H-Pyrido[4,3-b]mdole; 1 IH- 
Indolizino[l,2-b]quinolin-9-one; l,2-Dihydro-2,4a,9,-triaza-anthracene-3,10-dione; 6H- 
Isoindolo[2,l-a]mdole; l,5-Dihydro-benzo[b][l,4]dia2epin-2-one; 5,10-Dihydro- 
dibenzo[b,e][l,4]diazepin-l 1-one; 5,1 l-Dihydro-ben2o[e]pyrido[3,2-b][l,4]diazepin-6-one; 
4,9-Dihydro-3-thia-4,9-diaza-beii2o[flazuleii-l 0-one; Benzo[g]qumoxaline; Pyrazino[2,3- 

b] quinoxaline; Pyrido[2,l-b]quinazolin-l 1-one; l-Thia-4a,9-diaza-cyclopenta[b]naphflialene- 
4-one; 2-Methylene-4H-benzo[l,4]thiazin-3-one. 

124. A library according to claim 123, wherein said molecules are generated using at least 
one of the following scaffolds: 

Thiophene; IH-Pyrrole; Furan; Benzene; Pyridine; Pyrimidine; Pyrazine; 6H-Thieno[2,3- 
b]pyrrole; l,6-Dihydro-pyrrolo[2,3-b]pyrrole; IH-Indole; Thieno[2,3-d]pyrimidine; 6,7- 
Dihydro-pyrazolo[l,5-a]pyriinidine; Quinoline; Isoquinoline; Quinoxaline; 3,4-Dihydro- 
ben2o[e][l,4]diazepin-5-one; 3,8-Dihydro-4H-pyrrolo[2,3-e][l,4]diazepin-5-one; 3,4-Dihydro- 
thieno[2,3-e][l,4]diazepin-5-one; 3,6-Dihydro-4H-pynolo[3,2.e][l,4]diazepm.5-one; 5H,1 IH- 
Dibenzo[b,f] [ 1 , 5]diazocine-6,12.dione; 1 ,4-Dihydro- 1 OH- 1 ,4, 1 0- 1 ,4, 1 0-triaza- 

benzo[a]cyclopenta[e] cyclooctene-5, 1 1 -dione; 4H, 1 OH- 1 -Thia-4, 1 0-diaza- 

benzo[a]cyclopenta[e]cyclooctene-5, 1 1-dione; Dipyrrolo[l,2-c;2*,r-e]iimdazol-5-one. 

125. A library according to claim 123, wherein said at least 100 molecules comprise at least 
300 molecules. 

126. A library according to claim 123, wherein said at least 100 molecules of said library are 
generated using a single one of said scaffolds. 

127. A screening library comprising a set of at least 10,000 substantially rigid molecules. 
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128. A library according to claim 127, wherein said set comprises at least 50,000 
substantially rigid molecules. 

129. A library according to claim 127, wherein said set comprises at least 100,000 
5 substantially rigid molecules. 

130. A library according to claim 127, wherein said set is selected to have a an expected 
binding rate of at least 0. 1% of the library for protein targets in general. 

10 131. A library according to claim 1 30, whorein said expected binding rate is at least 0.5%. 

.132. A library according to claim 130, wherein said set is designed to provide molecules 
with a uniformity of hit probability for a generalized target of within a ratio of 1:100 for the 
whole set. 

15 

133. A library according to claim 132, wherein said ratio is within 1:10. 

134. A library according to claim 127, wherein said set spans a space of spatial chemical 
configurations, each such configuration defining a certain plurality of binding points having 

20 distances between them, the set covering substantially all possible configurations in the space 
in a given range of distances. 

135. A screening library, comprising: 

a plurality of at least 5,000 gauge molecules, each such molecule defining at least one 
25 spatial configuration of binding type points, 

wherein substantially each point in a space of such configurations is covered by at least 
two different gauge molecules. 

136. A library according to claim 135, wherein each point is covered by at least two 
30 substantially identical spatial configurations. 

137. A library according to claim 135, wherein each point is covered by at least two 
substantially different spatial configurations. 
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138. A library according to claim 135, wherein said space is a space of triangles defined by 
binding type at vertexes and distances between vertexes. 

S 139. A library according to claim 138, wherein said space includes distances of between 4 A 



140. A library according to claim 138, wherein said space includes distances of between 2 A 
and 10 A. 

10 

141. A library according to claim 138, wherein said space includes at least 5 different 
binding types. 

142. A library according to claim 138, wherein said space includes at least 7 different 
15 binding types. 

143. A libraiy according to claim 138, wherein said space includes onmi-directional binding 
types. 

20 144. A library according to claim 138, wherein said space includes directional binding types. 

145. A library according to claim 138, wherein said substantially each point in said space is 
covered by at least three gauges. 

25 146. A library according to claim 138, wherein substantially all the gauges include a 
plurality of configurations of said space. 

147. A method of obtaining information about a binding behavior of a target molecule, 
comprising: 

30 providing a set of substantially rigid chemical gauges, a significant nimiber of said 

gauges being expected to bind with said target; 



and 8 A (angstrom = lO'l® meters). 



reacting said target with a plurality of gauges of said set of gauges; and 
physically analyzing a structure of said target boimd to a gauge. 
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148. A method according to claim 147, wherein physically analyzing comprises analyzing 
using NMR. 

5 149. A method according to claim 147» wherein physically analyzing comprises analyzing 
using X-ray crystallography. 

150. A method according to claim 147» wherein physically analyzing comprises analyzing 
using binding with a set of gauges. 

10 

151. A method according to claim 147, comprising virtually super-imposing a plurality of 
stmctures obtained by said physically analyzing. 

152. A method of constructing a lead, comprising: 

1 5 providing a set of substantially rigid chemical gauges; 

reacting said target with a pliirality of gauges of said set of gauges; 

assaying a binding of said gauges with said target to obtain a plurality of assay results; 

and 

constmcting a lead based on said assay results. 



153. A method according to claim 152, wherein constmcting a lead comprises Imking 
together a plurality of gauges foxmd to bind in said assaying. 

154. A method according to claim 152, wherein constmcting a lead comprises modifying an 
25 existing molecule to have moieties that correspond to binding locations found by said 



assaying. 
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